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Preface 


This is the first of a series of three volumes (the other ones being [4] and [5]) 
devoted to the mathematics of mathematical olympiads. Generally speaking, they 
are somewhat expanded versions of a collection of six volumes, first published 
in Portuguese by the Brazilian Mathematical Society in 2012 and currently in its 
second edition. 

The material collected here and in the other two volumes is based on course 
notes that evolved over the years since 1991, when I first began coaching students 
of Fortaleza to the Brazilian Mathematical Olympiad and to the International 
Mathematical Olympiad. Some 10 years ago, preliminary versions of the Portuguese 
texts also served as textbooks for several editions of summer courses delivered at 
UFC to math teachers of the Cape Verde Republic. 

All volumes were carefully planned to be a balanced mixture of a smooth and 
self-contained introduction to the fascinating world of mathematical competitions, 
as well as to serve as textbooks for students and instructors involved with math clubs 
for gifted high school students. 

Upon writing the books, I have stuck myself to an invaluable advice of the 
eminent Hungarian-American mathematician George Pélya, who used to say that 
one cannot learn mathematics without getting one’s hands dirty. That’s why, in 
several points throughout the text, I left to the reader the task of checking minor 
aspects of more general developments. These appear either as small omitted details 
in proofs or as subsidiary extensions of the theory. In this last case, I sometimes 
refer the reader to specific problems along the book, which are marked with an * 
and whose solutions are considered to be an essential part of the text. In general, in 
each section, I collect a list of problems, carefully chosen in the direction of applying 
the material and ideas presented in the text. Dozens of them are taken from former 
editions of mathematical competitions and range from the almost immediate to real 
challenging ones. Regardless of their level of difficulty, we provide generous hints, 
or even complete solutions, to virtually all of them. 

This first volume concentrates on real numbers, elementary algebra, and real 
functions. The book starts with a non-axiomatic discussion of the most elementary 
properties of real numbers, followed by a detailed study of basic algebraic identities, 
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equations and systems of equations, elementary sequences, mathematical induction, 
and the binomial theorem. These pave the way for an initial presentation of algebraic 
inequalities like that between the arithmetic and geometric means, as well as those 
of Cauchy, Chebyshev, and Abel. We then run through an exhaustive elementary 
study of functions that culminates with a first look at implicitly defined functions. 
This is followed by a second look on real numbers, focusing on the concept of 
convergence for sequences and series of reals. We then return to functions, this 
time to successively develop, in detail, the basics of continuity, differentiability, 
and integrability. Along the way, the text stays somewhere between a thorough 
calculus course and an introductory analysis one. Lots of interesting examples and 
important applications are presented throughout. Whenever possible (or desirable), 
the examples are taken from mathematical competitions, whereas the applications 
vary from the proof and several applications of Jensen’s convexity inequality to 
Lambert’s theorem on the irrationality of z and Stirling’s formula on the asymptotic 
behavior of n!. The text ends with a chapter on sequences and series of functions, 
where, among other interesting topics, we construct an example of a continuous and 
nowhere differentiable function, develop the rudiments of the generating function 
method, and discuss Weierstrass’ approximation theorem and the rudiments of the 
theory of Fourier series. 

Several people and institutions contributed throughout the years for my effort 
of turning a bunch of handwritten notes into these books. The State of Ceara 
Mathematical Olympiad, created by the Mathematics Department of the Federal 
University of Ceara (UFC) back in 1980 and now in its 36th edition, has since 
then motivated hundreds of youngsters of Fortaleza to deepen their studies of 
mathematics. I was one such student in the late 1980s, and my involvement with this 
competition and with the Brazilian Mathematical Olympiad a few years later had a 
decisive influence on my choice of career. Throughout the 1990s, I had the honor 
of coaching several brilliant students of Fortaleza to the Brazilian Mathematical 
Olympiad. Some of them entered Brazilian teams to the IMO or other international 
competitions, and their doubts, comments, and criticisms were of great help in 
shaping my view on mathematical competitions. In this sense, sincere thanks go to 
Joao Luiz Falcio, Roney Castro, Marcelo Oliveira, Marcondes Franga Jr., Marcelo 
C. de Souza, Eduardo Balreira, Breno Falcao, Fabricio Benevides, Rui Vigelis, 
Daniel Sobreira, Samuel Feitosa, Davi Maximo Nogueira, and Yuri Lima. 

Professor Joao Lucas Barbosa, upon inviting me to write the textbooks to the 
Amilcar Cabral Educational Cooperation Project with Cape Verde Republic, had 
unconsciously provided me with the motivation to complete the Portuguese version 
of these books. The continuous support of Professor Hildrio Alencar, president of 
the Brazilian Mathematical Society when the Portuguese edition was first published, 
was also of great importance for me. Special thanks go to professors Abdénago 
Barros and Fernanda Camargo, my colleagues at the Mathematics Department of 
UFC, who had made quite useful comments on the Portuguese editions, which 
were incorporated in the text in a way or another; they had also read the entire 
English version and helped me in improving it in a number of ways. If it weren’t for 
my editor at Springer-Verlag, Mr. Robinson dos Santos, I almost surely would not 
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have had the courage to embrace the task of translating more than 1500 pages from 
Portuguese into English. I acknowledge all the staff of Springer involved with this 
project in his name. 

Finally, and mostly, I would like to express my deepest gratitude to my parents 
Antonio and Rosemary, my wife Monica, and our kids Gabriel and Isabela. From 
early childhood, my parents have always called my attention to the importance of 
a solid education, having done their best for me and my brothers to attend the 
best possible schools. My wife and kids filled our home with the harmony and 
softness I needed to get to endure on several months of solitary nights of work 
while translating this book. 


Fortaleza, Brazil Antonio Caminha Muniz Neto 
December 2016 
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Chapter 1 
The Set of Real Numbers 


This first chapter recalls some definitions and results which are essential to all 
further developments. We assume from the reader a modest acquaintance with the 
most basic concepts of set theory; we also assume that he or she is familiar with the 
sets of naturals, 


N = {1,2,3,4,...}, 
integers, 
Z = {0,+1,+2,+3,...}, 


and rationals, 


a 
Q= 5 abeZ,b¢ 0h, 
as well as with the elementary arithmetic operations within these sets. 

In what concerns the integers', given a,b € Z, with a £ 0, we say that a divides 
b if there exists an integer c such that b = ac. Equivalently, to say that a divides b is 
the same as to say that the rational number 2 is an integer; for example, 13 divides 


52, since EE = 4, 

If a divides b, we also say that a is a divisor of b, or that b is divisible by a; in 
such a case, we denote a | b. If a doesn’t divide b (or, which is the same, if 7 ¢ Z), 
we denote a + b. An integer n is even if 2 | n; otherwise, n is said to be odd. Hence, 


0, +2, +4, +6, ... are the even integers, while +1, +3, +5, ...are the odd ones. 


We refer the reader to Chap. 6 of [5] for a systematic discussion of what follows. 


© Springer International Publishing AG 2017 1 
A. Caminha Muniz Neto, An Excursion through Elementary Mathematics, Volume I, 
Problem Books in Mathematics, DOI 10.1007/978-3-3 19-5387 1-6_1 


2 1 The Set of Real Numbers 


Given natural numbers a and J, it is well known that there exist unique integers 
q and r satisfying the following conditions: 


b=aq+r and 0O<r<a. (1.1) 


The above relation is known as the division algorithm, and the integers q and r 
are respectively called the quotient and the remainder of the division of b by a. 
Conditions (1.1) are usually condensed in the diagram 
b a 
Ps 


q 


In particular, for given naturals a and b and in the above notations, a | b is the 
same as r = 0 andg = B. 

Two nonzero integers a and b always have a greatest common divisor, which 
will be denoted gcd(a, b); Moreover, a and b are said to be relatively prime if 
gced(a, b) = 1; in particular, if b = ka + 1, then a and b are relatively prime. 

If r = “, with m and n integers, is a representation of the nonzero rational 
r, then, by cancelling out factors common to m and n (i.e., cancelling gcd(m, n) 
out of m and n), we get an irreducible representation for r. For example, since 
gcd(—12, 18) = 6, the rational number =? has = as an irreducible representation, 
which was obtained by cancelling out a factor 6 from both —12 and 18. 

An integer p > | is prime if 1 and p are its only positive divisors; in another way, 


an integer p > | is prime if, for a € N, we have that 


PeN =>a=lorp. 

a 

It is a well known fact (cf. Chap. 6 of [5]) that the set of prime numbers is infinite. 
Below, we list all prime numbers less than 100: 


2, 3,5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 
59, 61, 67, 71, 73, 79, 83, 89, 97. 


An integer greater than | and which is not prime is said to be composite. It is also a 
well known fact (cf. Chap. 6 of [5]) that every natural number > 1 can be written 
(or decomposed) as a product of a finite number of powers” of prime numbers 
(its prime factors); also, such a representation of n is unique, up to the order of the 
powers. For instance, 9000 = 23-37-5? is the decomposition of 9000 as a product of 
powers of primes. The existence and uniqueness of such a decomposition of n > 1 
is known as the Fundamental Theorem of Arithmetic. 


To recall the definition and the main properties of powers of numbers, we refer the reader to 
Section 1.2. 
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1.1. Arithmetic in 


We use to represent rational numbers in decimal notation. For the rational number 
= for example, we write : = 0.125 as a shorthand for the equality 


1.4 Fa 2 a 5 
8 10 10? 103” 
and say that 0.125 is the decimal representation of : 
Some rational numbers have more complicated decimal representations. We take 


as an example the rational +: for which we usually write 


1 
— = 0.08333... 
12 


What does this equality mean? Arguing as in the case of 4, we are tempted to say 
that this equality is a shorthand for 


ne ae oe eee (1.2) 

12 102 103-104 105 
This is actually so, provided we correctly interpret the sum with an infinite number 
of summands at the right hand side. Rigorously, (1.2) means that, if we fix the a 


priori maximum error i = 0.00...01, then we have 
—— 


n 


2 1 8 n 3 4 3 dees ak 3 Ee 1 
12 102, 10-104 10k} ~ 10" 
for every natural number k > n; yet in another way, (1.2) means that all of the 
numbers 3, + 4; + 4; +--+ + 72, with k > n, approximate 7, by defect with 


0 
error less than or equal to 0. .O1. In fact, it will follow from Proposition 3.12 


00.. 
—— 


that 
1 8 n 3 ‘ 3 ood: ee 
12 102° 103-104 10k} ~~ 3- 10K’ 


so that the error in the defect approximation > + a + w ferret a of s equals 


ir , which is always less than or equal to the maximum error ~ whenever k > n. 


It is precisely in this sense that we should think of the equality 4 = 0.08333... 


4 1 The Set of Real Numbers 


In the light of the above discussion, one question suggests itself naturally: for an 
arbitrary sequence? (a\, 42, 43,...) of decimal digits, can we think of 


0.a,a2a3 eee 


as the decimal representation of some rational number? 

It is possible to prove (and we will do so in Problem 2, page 7) that the answer 
to this question is yes if and only if the list (a, a2, a3,...) is periodic from a certain 
point on, i.e., if and only if it is of the form 


(a1, 2,..., 41,1, b2,..., bp, b1, bo, ..., bp, bi, b2,..., Bp, .- »). (1.3) 
Sa nn nr Oe” 
P P P 


(In particular, for the rational number + the list is (0, 8, 3, 3,3,...), which is clearly 
periodic.) Therefore, if we are able to exhibit a list of digits that is not periodic from 
any point on, we will conclude that the general answer to the question posed above 
is no! We show an example of such a list now. 


Example 1.1 The sequence of digits (0,1,0,1,1,0,1,1,1,0,...), with infinitely 
many 0’s and such that the quantity of digits 1’s after each digit 0 equals the previous 
quantity of digits 1 plus one, is not periodic from any point on. 


Proof By the sake of contradiction, suppose that the list in the statement is periodic 
from some point on, with, say, a block of p digits that repeats itself. The way we 
defined the sequence assures that, from some point on, each occurrence of a block of 
digits 1 would bring more than p digits 1. Therefore, the block of digits that repeats 
itself should be composed only by digits 1, so that the list should be, from some 
point on, equal to (1,1,1,1,1,1,1,...). However, if this was so, then we could not 
have an infinite number of digits 0, which is a contradiction. Oo 


In short, the previous discussion points to the following deficiency of the set 
of rationals: every rational number admits a decimal representation, but not every 
decimal representation represents a rational number. At this point, in order to fulfill 
this deficiency, we postulate’ the existence of a set R, containing Q and having the 
following properties: 


3As we shall see in Section 6.1, a sequence of real numbers is just a function f : N — R; however, 
for our purposes here, we can think of a sequence just as an ordered list of numbers, i.e., a list of 
numbers in which we specify which is the first number, which is the second, third, and so on. We 
will sistematically discuss some important elementary sequences of real numbers in Chap. 3. 


“An axiom or postulate in a certain theory is a property imposed as true. One of the fundamental 
characteristics of Mathematics as a branch of human knowledge is the use of the axiomatic 
method, i.e., the acceptance of the fact that not every mathematical property can be logically 
deduced from previously established properties, being necessary the adoption of an adequate set 
of axioms. 
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(1) Addition, subtraction, multiplication and division in Q extend to similar 
operations on R, in the sense that they have, on R, properties similar to those 
on Q. 

(11) The order relation within Q extends to an order relation within R, so that it has, 
in R, the same properties its restriction has in Q; in particular, every element 
of R is negative, zero or positive. 

(III) In the sense of our previous discussions, to every sequence (a), a2, a3,...) of 
digits there corresponds a unique element x € [0, 1], which will be denoted by 
x = 0.a)a2a3.... Conversely, to every x € [0, 1], there corresponds a (not nec- 
essarily unique) sequence (qd), a2, 43,...) of digits, such that x = 0.a,a2q3.... 


The elements of R are called real numbers, and the set R as a whole is the set of 
real numbers>. 

In the rest of this section and in the next two section we discuss in detail each of 
the items (I), (II) and (III) above, showing what they really mean. 

Beginning with (I), we postulate that R is furnished with two operations, 
respectively denoted + and - and called (by analogy with the corresponding 
operations on Q) addition and multiplication. Such operations satisfy axioms (1) 
to (7), quoted below: 


(1) Consistency: for a, b € Q, the result a+ b of the addition of a and b is the same, 
whether we consider the usual addition on Q or the corresponding operation 
on R. Analogously, the result a - b of the multiplication of a and b is the 
same, whether we consider the usual multiplication on Q or the corresponding 
operation on R. 

(2) Commutativity: the operations + and - are commutative, i.e., they are such that 
a+b=b-+aanda-b=b.-a,foralla,beR. 

(3) Associativity: the operations + and - are associative, i.e., they are such that 
a+(b+c) = (a+b)+canda- (b-c) = (a-b)-c, foralla,b,c ER. 

(4) Distributivity: Multiplication is distributive with respect to addition, i.e., it is 
such that a- (b+ c) = (a-b)+ (a-c), foralla,b,c ER. 

(5) The roles of 0 and 1 in R: the rational numbers 0 and | are such that 0 +a = a 
and 1-a=<a, foreverya eR. 

(6) Law of cancellation: if a,b € R are such that a- b = 0, thena = 0 orb= 0. 

(7) Existence of inverses: for every a € R, there exists b € R such that a + b = 0. 
For every a € R \ {0}, there exists b € R such thata-b = 1. 


The properties above have the following important consequences: 


(i) Uniqueness of the additive inverse: for a given a € R, if b, b’ € R are such that 
a+b=0anda+bD' = 0, the associativity and commutativity of addition 


>There are more construtive (and rigorous) ways of introducing the real numbers, as the reader 
can find in [6], for example. However, in these notes we chose to follow an approach that was as 
close as possible to the previous experience of the medium reader, sacrificing rigor in the name of 
understanding. 
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give us 
b=b+0=b+4+(a+D) 

=(b+a)+b)' =(a+b)4+0d' 

=0+) =D’ 
Hence, the real number a has a unique additive inverse, which from now on will 
be denoted by —a, as is usually done within Q. It follows from a + (—a) = 0 
that a is the additive inverse of —a; therefore, according to the notation just 
established for additive inverses, we have —(—a) = a. 

(ii) Uniqueness of the multiplicative inverse: for a givena € R \ {0}, if b,b’ € R 
are such thata-b = 1 anda-b’ = 1, then b = DB’. The proof of this fact 
is completely analogous to that of the previous item (cf. Problem 3, page 7). 
From now on, we will denote the multiplicative inverse of a € R \ {0} by a}, 
as is also usually done within Q. 


(iii) For a € R, we have a- 0 = 0: in order to check this, let a- 0 = e. The 
distributivity of multiplication with respect to addition gives 


e=a-0=a-(04+0)=a-0+a-0=er+e. 
Hence, 
e=e+0=e+ (e+ (-e)) 
(e+ e) + (-e) 
=e+(-e) =0, 


II 


so thata-0O =e =0. 


In view of the above properties, we adopt the convention (as is usual within Q) to 
omit the sign - of multiplication, thus writing simply ab instead of a-b. Now, observe 
that the associativity and commutativity of addition and multiplication in R allow us 
to add or multiply an arbitrary finite number of real numbers without worrying with 
which summands or factors should be initially operated; the final result will always 
be the same°®. 

We also define the operations of subtraction (—) and division (~) in R as is 


usually done in Q: for a, b € R, we set 


a—b=a+(-b) and a+b=ab", 


6Rigorously speaking, the validity of such a statement should be proved as a theorem, what can 
be done with the aid of the principle of mathematical induction (cf. Chap. 5). However, in order 
to have a less terse reading, we chose not to give a formal proof of this fact, just relying on the 
previous experience of the reader. 
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with b # 0 in the last case; yet in the case of division, and whenever there is no 
danger of confusion, we write a/b or ¢ as synonymous for a + b. 


Problems: Section 1.1 


1. * Establish the following properties of proportions: if a, b, c and d are nonzero 
reals, such that 7 = ce then 


a Cc ate 


bd bt+td 


2. * Given a sequence (a1, da2,q3,...) of digits, prove that the real number 
0.a,a2a3... represents the decimal expansion of a rational if and only if the 
sequence (a1, a2, 43,...) is periodic from some point on, in the sense of (1.3). 

3. * Prove the uniqueness of multiplicative inverses in R. More precisely, prove that 
if a # 0 is a real number and b,b’ € R are such thata-b = a-b’ = 1, then 
b=bd'. 

4. Prove, from the axioms for addition and multiplication of reals, that —a = (—1)a, 
for everya ER. 

5. In each of the following items, decide whether the real number in question is 
rational or not. In doing so, assume that the pattern of digits suggested up to the 
dots is actually followed. Moreover, in case the number is rational, write it as an 
irreducible fraction: 


(a) 2.324444... 
(b) 0.12121212.... 

(c) 2.1345454545.... 

(d) 0.1234567891011121314.... 


1.2. The Order Relation in R 


We postulate the existence of an order relation on R, which amounts to a way 
of comparing real numbers. By the sake of analogy with the corresponding order 
relation on Q, we denote this order relation by >, and also read it as greater than or 
equal to. The order relation on R satisfies axioms (1’) to (5’) below: 


(1’) Consistency: for a,b € Q such that a > bin Q, we havea > binR. 
(2’) Reflexivity: a = a, for every a € R. 

(3’) Antisymmetry: if a,b € R are such that a > b and b > a, thena = BD. 
(4°) Transitivity: if a,b,c € Rare such that a > band b > c, thena > c. 
(5’) Dichotomy: for all a, b € R, one has either a > b or b > a. 
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In all that follows, if a,b € R are such that a > b anda # b, we write a > b 
and read a is greater than b. We also write a < b (read a is less than or equal to b) 
as a synonymous for b > a, anda < b (read a is less than b) as a synonymous for 
b> a. Ifa € Ris such that a > 0, we say that a is positive; if a < 0, we say that a 
is negative. 

With respect to the order relation > on R, we also impose axioms (6’) and (7’) 
below, which guarantee — as they do in Q — its compatibility with the operations of 
addition and multiplication: 


(60°) a>bsa-b>Q0. 
(7) aab>0>a+b,ab> 0. 


The next result collects some other useful properties of the order relation on R, 
which can be deduced from axioms (1’) to (7’). From now on, we say that two 
nonzero real numbers have equal signs if they are both positive or both negative. 


Proposition 1.2 Leta,b,c,d€R. 


(a) Ifa > 0, then —a < 0, and vice-versa. 
b>0>ab>0 

(b) Hfa>0,then >> eo: 

b>0>ab<0 

b<0>5ab>0° 

(d)a>b>a+c>bee. 

(e) a>b,c>d>at+c>be4+d. 


(f) Ifa > b, then | ©> ) = a> oe 


(c) Ifa <0, then 


c<0=> ac <be 

(g) a#0S>a>0. 

(h)a>0@+>0. 

(i) Ifaand b have equal signs and a > b, then + < i 


Proof We prove just a few of the above items, letting the others as exercises (see 
Problem 1, page 10). 


(a) This follows from axiom (6’): 
0>-a}0-(-a)>0@a>0. 
(d) Again by axiom (6’), we have 
a>b>a-b>0353 (at+c)-(b+c0e)>05a+cec>bHt+e. 
(e) We use (d) and the transitivity of >: 


a>b>a+c>be+ec 


bed 
Sipser 
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(f) Suppose c > 0 (the case c < 0 can be treated analogously). It follows from (6’) 
and (7’) that 


a>b>a-b>053 (a-b)c>05 ac—be>0 > ac > be. 


(h) Suppose a > 0. If we had 4 < 0, it would follow from (b) that 1 = a- 4 < 0, 
which is an absurd (for the order relation on R extends that on Q). 
(i) Since a and b have equal signs, it follows from (b) and (c) that ab > 0. Thus, 
item (h) gives + > 0. Since b — a < 0, we get from either (b) or (c) that 
1 1 b— 1 
—--—--= Le | ee, TG 
a b ab ab 
Finally, (6’) assures that this is equivalent to + < i 
O 


Items (b) and (c) of the former proposition are known as the sign rules for 
multiplication of real numbers. 

Next, let r € R be given. The square of r, denoted r’, is the real number 7? = r-r; 
the cube of r, denoted r?, is the real number r? = r-r-r. More generally, for a given 
n € N, we define the n—th power of r, denoted r”, as being r, if n = 1, or the real 
number obtained by multiplying r by itself n times, ifn > 1: 


Once more we call the attention of the reader to the fact that the associativity of the 
multiplication of reals, together with the principle of mathematical induction (cf. 
Chap. 5) allows us to prove that the result of the right hand side of the equality above 
does not depend on the order in which we multiply the n copies of r. Therefore, 7” 
is a well defined real number. Problem 3, page 10, lists some useful properties of 
the powers of real numbers. 

We collect below an important consequence of the properties of the order relation 
on R, listed in the last proposition. 


Corollary 1.3 Let r be a positive real number and m and n be natural numbers, 
with m > n. Then: 


(a) 0<r<1Sr"<r". 
(b)r>1>r">r". 


Proof 


(a) Since r is positive, multiplying both sides of the inequality r < 1 by 7, we get 
r? <r. Multipliying both sides of this last inequality by r once more, it follows 
that r> < r? and, hence, that r? < r* < r. Proceeding this way, we arrive at the 
desired result, i.e., 


weicpfcr<r <r. 
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(b) The proof of this item is essentially equal to that of (a), with the only difference 
that, in this case, we have r > 1. 
oO 


We next illustrate, in an example, how the above corollary can be useful in 
comparing certain real numbers. 


Example 1.4 In order to compare 2! + 3! and 4!°, for instance, it suffices to 
see that 


7100 ai 3100 < 3100 ai 3,100 = Dy 3,100 
= 2-33.37 = 54.37 
< 64. 47 = 43 : 47 = 400° 
In Chapter 5, we will sistematically study some important inequalities involving 


real numbers. For the time being, the following corollary — in spite of its simplicity 
— will play an important role. 


Corollary 1.5 Fora,b € R, we have 
a+b? >0, (1.4) 


with equality if and only ifa = b = 0. 


Proof Item (g) of Proposition 1.2 gives a?,b? > 0. Therefore, item (d) of that 
proposition gives a? + b” > 0. Now, let a # 0. Then, it follows from item (g) 
of the above mentioned proposition that a? > 0. On the other hand, since we still 
have b” > 0, item (e) of that proposition guarantees that a” + b? > 0. o 


Problems: Section 1.2 


1. * Prove items (b), (c) and (g) of Proposition 1.2. 
2. * Generalize Corollary 1.5, showing that, if a, b,c € R, then 


a+b’+c?>0, 


with equality if and only ifa = b=c=0. 
3. * Givenr,s € R andm,n €N, prove that’: 


(a) (rs)" = r"s". 


"In order to prove properties (a) to (d) in a rigorous way, we have to rely on the principle of 
mathematical induction (cf. Chap. 5). Thus, our intention here is simply to make the reader give 
some arguments on their plausibility. 


10. 
11. 


12. 


13. 


14. 


15. 


16. 


17. 


: Prove that  — 4 + fotteng 
. Given positive real nuibete a and b eh that a < b, compare (i.e., decide 
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(b) rn = pm, 
(c) y= a ee 
(d) (£)" =4, ifs £0. 


. * For r € R \ {0} andn e€ N, we extend the notion of powers with natural 


. . : —n _ 1 
exponents to powers with integer exponents by defining r-“" = —. For example, 


ris , res + etc. If we also set r° = 1, prove that, for all m,n € N, one 


yn — 
hasG=r"". 


. Let a and b be nonzero integers, with b > 1. If the only prime divisors of b are 


2 or 5, prove that the decimal representation of ¢ is finite. 


. *Ifx #0 is areal number andn €N, prove that x” is positive if n is even, and 


has the same ee of x if 5 is odd. 
1 
3 ‘5 To 7 3: 


2 


which is the greatest of) the numbers 2 


b 


e 


. (TT) We are given ten real numbers such that the sum of any four of them is 


positive. Show that the sum of all ten numbers is also positive. 
Decide which of the numbers 31!! or 17!* is the greatest one. 
* Let n € N anda, b be positive reals. Prove that: 


(a) a < bif and only if a? < Db’. 
(b) a < bif and only if a” < b”. 
(c) a? +b" <(a+b)". 


(EKMC - adapted) Let a, b and c be the lenghts of the sides of a right triangle, 
c being the hypotenuse. Which one is the greatest: a? + b? or c*? Justify your 
answer. 

Show that, for every n € N, we have 127 + 27" 4.32% > 2.7", 

* Find all natural numbers a, b and c such that a < b < c and + + i + i is an 
integer. 

(IMO) Explain how to write 100 as a sum of naturals whose product is as large 
as possible. 

(Russia)® The leftmost digit of the decimal representations of the natural 
numbers 2” and 5” is the same. Prove that such digit is equal to 3. 

(Russia) Let a, b, c and d be positive real numbers. Show that, among the 
inequalities 


a+b<c+d, (a+ b)(c+ 4) <ab+cd and (a+ b)cd < ab(c+ 4d), 


at least one is false. 


8For a converse to this problem, see Example 10.60. 
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1.3 Completeness of the Real Number System 


For the time being, we postulate that to every sequence (a1, da2,a3,...) of digits, 
there corresponds a unique x € R, in the following sense: for a fixed maximum 


error with n € N, we have 


1 
ed rae TO a 


for every natural k > n. In particular, it follows from this inequality that 


1 de ay : ak 1 
Xx — — eee — =F 
~ 10 ~~ 102 10 = 10" 


for every k > n. Taking k = n and recalling that a; < 9 for every j, we get 


a a2 Gn-1 an 1 


* S97 To TF Tomt 7 Tor © To 
sag eee ae 
~ 10 102 107-1 10” 10” 

9 9 9 1 
= Tot 72 tt ost t ot 
9, 9 1 

To’ 12 7° Tor? 
=-.-=], 


Thus,O<x< 1. 

As a shorthand to the above postulate, we say that the set R of real numbers is 
complete. In this respect, we refer the reader to Section 7.1, as well as to Problem 9, 
page 242, where the completeness of the real number system will be more rigorously 
discussed. 

Conversely, we also postulate that to every positive real number x there cor- 
responds a nonnegative integer m and a sequence (a1, a2, a3, ...) of digits, such 
that x = m+ 0.a,a2a3..., in the sense of the previous paragraph. If m > 0 and 
m = b,,...b,bo, with the b;’s being their digits, we write 


x= by wee bi bo.ayana3 ae 


and say that b, ...b,bo.a;a2a3.... is the decimal representation or expansion of x. 

As was seen in Problem 2, page 7, a real number is rational exactly when its 
decimal representation is finite or infinite and periodic. On the other hand, a real 
number which is not rational is said to be irrational. Thus, irrational numbers 
are those real numbers which cannot be written as quotients of two integers, 
or, in another way, those reals whose decimal representations are infinite and 
nonperiodic. 
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Up to the present, the only irrational number we have met was the number 
0,0101101110... (cf. Example 1.1). In a sense, such an example is pretty dis- 
satisfying, for it is difficult to manipulate this number (i.e., it is difficult to make 
calculations with it). We will remedy this situation in what follows. 

From an arithmetic point of view, one great advantage of the set of reals, in 
comparison to the set of rationals, is the possibility of making root extractions of 
positive real numbers. More precisely, given a real number x > 0 and a natural 
number n, it is possible to prove (and we will do so, in two different ways, in 
sections 7.1 and 8.3) that there exists a unique positive real number y such that 
y” = x. From now on, we shall call this real number y the n—th root of x, and write 
y = %/x; the natural n is the index of the root. In short, 


y= te oxay". 


Indices n = 2 and n = 3 occur so frequently that deserve special names and 
notations. When n = 2 (and x > 0), we write simply ./x, instead of 2/x, and 
say that ./x is the square root of x; when n = 3 (and x > 0), we say that 3/x is the 
cubic root of x. 

The argument in the next example allows us to heuristically understand why roots 
of positive reals do exist. 


Example 1.6 By definition, we have V2 = 2. Since 12 < 2 < 2?, the result 
of Problem 11, page 11, gives 1 < /2 < 2. Now, since 1.47 < 2 < 1.5?, it 
follows once again from that problem that 1.4 < /2 < 1.5. Analogously, since 
1.412 <2 < 1.422, we have 1.41 < J2 < 1.42. Continuing this way, we obtain 
a unique list (aj = 4,a. = 1,a3 = 4,...) of digits, such that l.ajaz...d,—1dy < 
J/2 < l.ajd... Gn—1, for all n > 1, where a’, = da, + 1. Thus, we have no option 
but to conclude that /2 = l.ajana3... = 1.414... 


At this point, we urge the reader to at least read the statements of Problems | 
and 2, page 14, to get an idea on how to (partially) extend the concept of n—th root 
to negative reals, as well as to take a look at the main properties of root extraction. 

We now turn to powers of natural numbers. A perfect square is a natural number 
which can be written in the form m7, for some m € N; hence, the perfect squares 
are the natural numbers 17 = 1, 2? = 4, 37 = 9, 4? = 16 etc. A perfect cube is a 
natural number which can be written in the form m?, for some m € N; the perfect 
cubes are, then, the naturals 17 = 1, 2? = 4, 3° = 27, 4° = 64 etc. More generally, 
a natural n is a perfect power if there exist natural numbers n and k, with k > 1, 
such that n = m*. If this is the case, we say that n is a k-th perfect power, i.c., is 
equal to one of the natural numbers 1*, 2", 3, 4* etc. Equivalently, to say that n ¢ N 
is a k-th perfect power is the same as to say that its kK—th root, 4/n, is a natural 
number. 

The following proposition, which by now we take for granted, gives an infinite 
supply of examples of irrational numbers, which, as we will see in a moment, are 
more or less easy to handle. For a proof of it, we refer the reader to Chap. 6 of [5]. 
(Nevertheless, see Problems 7 and 8.) 
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Proposition 1.7 Given natural numbers n and k, with k > 1, either n is a k-th 
perfect power or &/n is an irrational number. 


According to the above proposition, numbers like /2, 73, \/10 etc are all 
examples of irrationals (for 2 is not a perfect square, 3 is not a perfect cube and 
10 is not a perfect 5—th power). 

Now we can, at least formally (i.e., without worrying with approximations), 
operate with several irrational numbers. Let us see an example where we (indirectly) 
apply the last proposition to explain why a certain real number is irrational. 


Example 1.8 The number J/2 + V3 is irrational. In fact, if we let r = J2 + J3, 
there are two possibilities: r € Q or r ¢ Q. Suppose, for the sake of contradiction, 
that r € Q. Then, since the set of rationals is closed with respect to multiplication, 
we would have 7? € Q. On the other hand, the distributivity of multiplication with 
respect to addition gives 


P = (724 V3)(V2 + V3) 
= V2(/2 + V3) + V3(V/2 + V3) 
= (2+ V6) + (V6 +3) 
=54+2V6 


so that /6 = a Therefore, 6 would be the quotient of the rational numbers 
r? —5 and 2 and, as such, /6 would be itself a rational number. This contradicts the 
result of Proposition 1.7, so that r € Q. 


In order to finish our discussion on rational and irrational numbers, note that the 
set of irrationals is not closed with respect to the ordinary arithmetic operations. 
For instance, for a given irrational number r, although —r is also irrational, we have 
r+ (—r) = 0, which is a rational. On the other hand, if we set r = J/2, we get 
r-r =r = 2, which is also a rational. Finally, if r 4 0, then the quotient of r by 
itself is equal to 1, again a rational number. 


Problems: Section 1.3 


1. * Given x < 0 real and n € N odd, let y = —¥/—x. Prove that y” = x (so that the 
real number y is also called the n—th root of x). 
2. * Given m,n € N and x, y > 0, prove that: 


(a) yay = ay. 
(b) w/e = Vea. 
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Then, extend the above properties to all nonzero reals x and y, provided m and n 
are odd naturals. 

3. * Let a and b be rational numbers, and let r be an irrational one. If a + br = 0, 
prove thata = b= 0. 

4. Let a, b, c and d be rational numbers, and let r be an irrational one. If a + br = 
c + dr, prove that a = c and b = d. 

5. Let r be a positive real and let k be an integer greater than 1. If r is irrational, 
prove that the numbers + and /r are also irrational. 

6. (Canada) Let a, b and c be rational numbers, such that a + b/2 +cJ/3 = 0. 
Prove that a, b and c are all equal to zero. 

7. * Assuming the validity of the Fundamental Theorem of Arithmetic (according 
to the last paragraph of the introduction to this chapter), prove that /2 is an 
irrational number. 

8. Let p be a prime number, and k > 1 be natural. Prove that ¥/p is irrational. 


1.4 The Geometric Representation 


A quite useful way of thinking geometrically on the set of rational numbers is the 
following: we take a line r, and mark on it a point O; then, we choose one of the half- 
lines that O determines on r, call it positive (the other being called negative) and set 
a line segment £ as unit of measure. Then, we associate to each rational number a 
point of 7, in the following way: first, we associate 0 to the point O; then (according 
to Figure 1.1), given a rational number ¢, with a, b € N, we mark, starting from O 
and on the positive half-line, a line segment OA of length aé (i.e., OA is obtained by 
juxtaposition of a segments equal to £). If b = 1, we associate ¢ = a to the point A. 
If b > 1, we divide OA into b equal line segments, by plotting b — 1 points on OA; 
from these b — 1 points, we let B denote the one closest to O, and associate ; to the 
point B. It is not difficult to show (cf. Problem 1) that this geometric construction is 
consistent, in the sense that, by changing ¢ for another equivalent fraction, we get 
the same point B on r. Moreover, an analogous construction can obviously be made 
for the negative rationals, marked on the negative half-line. 

By proceeding as described in the previous paragraph, it happens that there are 
lots of points on r which are not associated to any rational number. In order to give a 
simple example, let A be the point associated to the number |, and construct a square 


O B A r 


Fig. 1.1 plotting rationals on a real line. 
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O AE r 


Fig. 1.2 a point that doesn’t represent any rational number. 


OABC, as shown in Figure 1.2. By using a compass, mark on the positive half-line a 
point E such that OF = OB. Since OA = 1, it follows from Pythagoras’ Theorem!? 
that OF = OB = V2. However, since J/2 is irrational (cf. Proposition 1.7 or 
Problem 7, page 15), we conclude that E is not associated to any rational number. 

At this point, two natural questions pose themselves: is it possible to mark on r 
all of the real numbers? If we assume that the answer to the previous question is 
yes, then, after we mark all reals on r, will there be unmarked points on r? One of 
the axioms of Plane Euclidean Geometry!!, stated below, assures that the answers 
to these two questions are, respectively, yes and no. 


Axiom 1.9 There is a one-to-one correspondence between the points on an 
Euclidean line r onto the set of real numbers, which is completely determined 
by the following choices: 


(a) A point O on r, to represent the real number 0. 

(b) A half-line, among those that O determines on r, where we mark the positive 
reals. 

(c) A point on the half-line of item (b), to represent the number 1. 


If we fix choices on a line r as specified by the above axiom, we say that r is the 
real line (cf. Figure 1.3). 
For further use, we need the following definition. 


Definition 1.10 For given real numbers a < b, we set!*: 


(i) [a,b] = {x € Rs a <x < dD}. 


°Pythagoras of Samos was one of the greatest mathematicians of classical antiquity. The theorem 
that bears his name was already known to babylonians, at least two thousand years before he was 
born; nevertheless, Pythagoras was the first one to prove it. It is also attributed to him the first proof 
of the irrationality of J/2. 


'0We recall that Pythagoras’ Theorem, one of the most celebrated (and important) results of Plane 
Euclidean Geometry, says that, in every right triangle, the square of the length of the hypotenuse 
equals the sum of the squares of the lengths of the legs. For two different proofs of it, see chapters 4 
and 5 of [4]. 


‘For an axiomatic construction of plane Euclidean Geometry, we refer the reader to [16]. 


We call the reader’s attention to the less common notations [a, b[ instead of [a, b), ]a, b] instead 
of (a, b], Ja, b[ instead of (a, b), [a, +00] instead of [a, +-00) and ] — 00, al instead of (—00, a). 
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Gili) [a,b) = {xe Ria<x< bd}. 
(iii) (a,b] = {x € R;a<x<D}. 
(iv) (a,b) = {x Ee R;a<x < Dd}. 
(v) [a, +00) = {x € Rj a < x}. 
(vi) (a, +oo) = {xe Ria <x}. 
(vii) (—oo, a] = {x € R; x < a}. 
(viii) (—oo, a) = {x € R; x < a}. 


An interval in R is the set R itself, or a subset of R of any of the seven types 
above. Observe that, in the real line, an interval corresponds to a line segment 
(perhaps with the exclusion of one or both of its end points), to a half-line (perhaps 
with the exclusion of its real end point), or even to the whole real line. 


Remarks 1.11 


i. We stress that the symbols +-oo and —oo (one respectively reads plus infinite 
and minus infinite) do not represent real numbers. They merely serve to point out 
that, in each of the items (v), (vi), (vil), (viii) above, the corresponding intervals 
do contain all reals greater than or equal to (resp. greater than), or less than or 
equal to (resp. less than) a. 

ii. According to the previous definition, we shall denote R = (—oo, +00). 


Given real numbers a < b, we say that a and b are the endpoints and that b—a is 
the length of each one of the intervals of items (i) to (iv), in the previous definition. 
In this case, we also say that those intervals have finite length. Analogously, the 
real number a is the (only) endpoint of each of the intervals of items (v) to (viii), 
which have infinite (1.e., not finite) lengths. An interval in R is bounded provided 
it has finite length; otherwise, the interval is said to be unbounded. In particular, 
the bounded intervals of R are precisely those of items (i) to (iv), in the previous 
definition. 

If a bounded interval has endpoints a < b, we shall say that it is closed, closed 
on the left, closed on the right or open when such an interval is respectively equal 
to [a, b], [a, b), (a, b] or (a, b). Alternatively, we say that [a, b) is open on the right 
and (a, b] is open on the left. Finally, infinite intervals are named accordingly, by 
using obvious extensions of the above terminologies. Figure 1.3 shows the interval 
[a,b), 


0 a b R 


Fig. 1.3. the open on the right interval [a, b). 
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Problems: Section 1.4 


1. * With respect to the geometric interpretation of rational numbers, discussed at 

section 1.4, let the fractions $ and S be given, where a,b,c,d € N. If $ — ee 
explain why the construction given in the text associates both these fractions to 
the same point of the real line. 


Chapter 2 
Algebraic Identities, Equations and Systems 


The rest of this volume, up to Chap.5, approaches and develops several tools 
necessary for an adequate presentation of the material in volumes 2 and 3. We 
start by studying, in this chapter, some important algebraic identities, equations and 
systems of equations. 


2.1 Algebraic Identities 


Through the rest of these notes, we refer to a varying real number as a real 
variable.' In general, real variables will be denoted by lower case Latin letters, 
for example a, b, c, x, y, z etc (an important exception to this usage is mentioned in 
the next paragraph). 

An algebraic expression, or simply an expression, is a real number formed 
from a finite number of real variables, possibly with the aid of one or more 
algebraic operations, i.c., additions, subtractions, multiplications, divisions, power 
computations and root extractions (whenever the results of these operations make 
sense in R). In particular, every real variable can be seen as an algebraic expression. 
For another example, 


a: = ye 
x+ J¥—*2 + 3 x2yz3 — x4 


YZ 


is an algebraic expression which makes sense for all reals x, y, z, such that y > 0 
and z # 0 (recall that Problem 1, page 14, assures that we can extract roots of odd 
index of any real number). We shall denote algebraic expressions by upper case latin 
letters, as E, F etc. 


‘In [5], we shall have the opportunity to consider complex variables. 
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We say that an algebraic expression E is a monomial if E is a product of 
a given nonzero real number by powers of its variables, each of which having 
nonnegative integer exponents. Thus, the monomials in the real variables x and y 
are the expressions of the form ax“y', where a 4 0 is a given real number and 
k,l > O are nonnegative integers (here, we adopt the convention that x* = 1 
whenever k = 0, and y’ = 1 whenever / = 0—see Problem 4, page 11). For an 
arbitrary monomial, the given nonzero real number that plays the role of a in ax*y! 
is called its coefficient. Hence, the monomials in x, y with coefficient 2 are those of 
one of the forms 


2, 2x 2y, 2x, 2xy, 2y’, 2x3, 2x’y, oxy’, ay" etc. 


A polynomial expression or simply a polynomial is (an expression that is) a 
finite sum of monomials, as, for instance, 


2+ 3xy- V5xyz. 


The coefficients of a polynomial are the coefficients of its monomials. 

Let E and F be algebraic expressions. We say that equality E = F is an algebraic 
identity provided it is true for all possible values of the involved real variables. 
In order to give a relevant example, let us consider the algebraic expression E = 
(x + y)”. The elementary properties of the operations of addition and multiplication 
of real numbers (i.e., commutativity and associativity of addition and multiplication, 
as well as distributivity of multiplication with respect to addition) give 


E=(«x+y)at+y) =x(x+y)+y@+t+y) 
= (x + xy) + Ox+y’) 
=x 4 2xy+y’, 


for all values of the real variables x and y. Therefore, by setting F = x? + 2xy + y’, 
we obtain the algebraic identity E = F, i.e., 


(x+y) =x + 2xy+y’, (2.1) 


to which we refer, from now on, as the formula for the square of a sum of two real 
numbers. 

The following proposition collects some important algebraic identities, which the 
reader must keep for future use. 


Proposition 2.1 For all x,y,z € R, we have: 
(a) x —y? = (x—y)(x+y). 

(b) (xty)? =x* +2xy4+- y’. 

(c) P+y =(+y)@ Fayty’). 
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(d) xtyp=xrHPty + 3x y). 
(ce) &@+yt+ Var +y +2 + Ay + 2xz + yz. 


Proof We let the proofs of items (a)—(c) as exercises (see Problem 1), observing that 
the identity of item (b), with the + sign, was established in (2.1). In item (d), let us 
prove the identity for (x + y)*; that for (x — y)? is totally analogous: by invoking the 
distributivity of the multiplication with respect to addition, as well as identity (2.1), 
we get 


@+y)? =@+yW@+yW =xe+y +y@e4+y) 
= x(x? + Ixy +’) + y@? + 2xy +’) 
= (x? + 2x’y + xy’) + (@’y + 2xy’ + y’) 
= x3 + y? + 3x’y + 3xy’ 
=x + y + 3xy(x+ y). 


In order to get the result of item (e), we apply that of item (b), with x + y in the 
place of x and z in the place of y: 


@+yt+2 =[%+y 427? 
=(t+y’*+2e04+yzt+2 
= (7° + xy +y) + 2ez+y) +2 
Hr ty +2 + Qny + Qxz + 2yz. 


oO 


The reader has certainly noticed that, in the previous proposition, one either has: 
(i) an identity of the form EF = F, where E is a product of (at least two) polynomials 
and F is the sum of monomials we get from expanding the products in E (this is the 
case of the identities of items (b), (d) and (e)); or else (ii) an identity of the form 
E = F, where E is a polynomial and F is a product of (at least two) polynomials (as 
in items (a) and (c) of the previous proposition). In case (ii), we shall sometimes say 
that F is a factorisation of £, or that it is obtained by factoring out expression E. 

The coming examples will give us an idea on how to apply the identities collected 
in the previous proposition to solve several interesting problems. 


Example 2.2. Let x, y, z be real numbers, not all zero, such that x+-y+z = 0. Explain 
why xy + xz + yz 4 0 and, then, compute all possible values of the expression 
ey Fe 
xy tyz+ zx 
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Solution Squaring both sides of x + y +z = 0, it follows from item (e) of 
Proposition 2.1 that x* + y? + 27 + 2(xy + xz + yz) = 0. Ifxy + xz + yz = 0, we 
would have x7 +y?+z* = 0, and a simple extension of Corollary 1.5 (cf. Problem 2, 
page 10) would give us x = 0, y = 0 and z = 0, contradicting our hypotheses. 
Therefore, xy+xz+ yz # 0, and it follows from x* + y? +z? = —2(xy+xz+-yz) that 
eP+yte 
XY + YZ 2x 
oO 


Our next example shows how to use the algebraic identities we know so far to 
prove inequalities.” 


Example 2.3 (Poland) For given positive real numbers a and b, prove that 4(a? + 
b’) > (a+b). 


Proof By expanding the right hand side with the aid of item (d) of Proposition 2.1, 
it is immediate to see that the inequality we want to prove is equivalent to a? + b? > 
a*b + ab’. It now suffices to see that 


a+b) —a@b—ab? =a —a’b+b —ab’ = a (a—b) -— b’(a—b) 
= (a — b’)(a—b) = (a+ b)(a— b)(a— b) 
= (a+ b)(a—b)* = 0, 


fora + b > Oand (a—b)* > 0. oO 
We now generalize Example 1.8. 


Example 2.4 (Austria) Let a and b be positive rationals, such that /ab is irrational. 
Prove that ./a + »/b is also irrational. 


Proof By contraposition, suppose that r = ,/a+ <b were a rational number. Then, 


r? = a+b+2J/ab would also be rational. However, in such a case, we would have 


r—a—b 


Vab = 2 


’ 


which would be a rational number too, for, in the right hand side of the above 
equality, both the numerator and the denominator are rational numbers. Oo 


Example 2.5 (Canada) For each natural number n, prove that 
n(n + 1)(n + 2)(n + 3) 


is never a perfect square. 


We will undertake a thorough discussion of inequalities in Chap. 5 and Sects. 9.7 and 10.8. 
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Proof Letting p = n(n + 1)(n + 2)(n + 3), we have 


p = (n+ 3)]l(n+ I) + 2)) 
= (n* + 3n)[(n* + 3n) + 2] 
= (n? + 3n)? + 2(n? + 3n) 
= [(n? + 3n)* + 2(n? + 3n) + 1]-1 
= [(n? + 3n) + 1]? -1. 
If we set m = n? + 3n + 1, we have m > 1 and, hence, 
p=m—1>m—-2m+1=(m-1)’. 
Therefore, p is situated between the consecutive perfect squares (m— 1)? and m?, so 
that it cannot be, itself, a perfect square. Oo 
Apart from the algebraic identities collected in Proposition 2.1, another fre- 
quently useful one is that given by the equality 


(x—y)(@—z) =x’ — (yt 2x4 yz. (2.2) 


Observe that, at the right hand side of the above expression, both the sum S$ = y + z 
and the product P = yz of y and z do appear. An expression of the form x? — Sx + P, 
where S and P represent the sum and the product of two numbers or expressions, is 
called a second degree trinomial in x. Hence, writing (2.2) backwards, we can also 
see it as giving a factorisation for the second degree trinomial x* — Sx + P, where 
S=y+zandP = yz: 


x? —Sx+P=(x-—y)(x-2). (2.3) 


The above factorisation is sometimes called Viéte’s formula, in honor of the French 
mathematician Francois Viéte.° 
The following example shows us how to apply Viéte’s formula. 


Example 2.6 (Soviet Union) Let a, b and c be pairwise distinct real numbers. Show 
that the number 


a(c—b)+ b’(a—c) + c’(b—a) 


is always different from zero. 


Francois Viéte, French mathematician of the XVI century. By his pioneerism in the usage of letters 
to represent variables, Viéte is sometimes called the father of modern Algebra. 
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Proof Letting S denote the given number, we have 


S=a@(c—b)+bVa—-b’c+Cb—ca 

= a’(c —b) + (b’'a— ca) + (b—b’c) 
a’(c —b) + a(b +. c)(b—c) + bc(c —b) 
(c — b)[a? — a(b +c) + be] 
= (c—b)(a—b)(a—c), 


II 


II 


where we used (2.3) in the last equality. Now, it follows from a # b, b # c and 
c #athata—b,c—b,a—c #0,so that S £ 0. oO 


A useful variant of Viéte’s formula is the factorisation for the expression x7 + 
Sx + P, where, as before, S = y + zand P = yz: 


e+ Sxt+P=(x+y)(«4+2). (2.4) 


If we change S, y and z in (2.3) respectively by —S, —y and —z, we immediately see 
that (2.4) is indeed equivalent to that factorisation. 

The next example uses (2.4) to get yet another algebraic identity, which will be 
further applied in a number of places, both in this volume as well as in [4] and [5]. 


Example 2.7 For all x,y,z € R, we have 
@tytPFaerty+24+3e+ye+Dyt+. (2.5) 


Proof Applying item (d) of Proposition 2.1 twice, first with x + y in place of x and 
z in place of y, we successively get 


atytz =[~%+y +2 
=(xtyPt243@4+yex+y 47 
Harty t3xyety) +24 3e@4+ye+y)zt2) 
Har+yt2t3a+y byt &+y)z4+ 2] 
HaP+y4343e+yWtde4+2, 


where, in the last equality, we have used the variant (2.4) of Viéte’s formula. oO 


Problems: Section 2.1 


1. * Prove the other items of Proposition 2.1. 


2.1 


2; 


11. 
12. 
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Ifm-+n-+p = 6, mnp = 2 and mn + mp + np = 11, compute all possible 


Mya BD 
values of a + a +=. 


. Let a and b be nonzero real numbers, such that a # b, 1. If Gy = Gea 


1 


compute all possible values of + Bae 


. Given positive real numbers x and y, simplify the expression 


-(5)" 


(Ja fy? + 2/07" 


. For x,y,z 4 0, such that y + z # 0, simplify the expression 


(x3 +y ao 2) _ Ge -y — 2) 
ytz : 


. Let a and b be real numbers such that ab = 1 anda # b. Simplify the 


expression 


. Let x and y be natural numbers such that x? + 361 = y?. Find all possible 


values of x. 


. Real numbers a and b are such that a+ b = mand ab = n. Compute the value 
of a* + b* in terms of m and n. P 
. Ifa’ + b* = 1, find all possible values of aan : 


. (EKMC) Let a, b, c and d be real numbers such that a7+b* = 1 andc?+d?=1. 


Ifact+bd = eee compute the value of ad—bc, provided it is a positive number. 
(Brazil) Find all natural numbers x and y such that x + y + xy = 120. 

* Given positive distinct real numbers x and y, prove that the following 
rationalisations* are valid: 


@ zh = 


1 = 
0) yy = SS 


6 —————— 
( ) SOx 3fat a y2 xary 


4In an informal way, one can think of a rationalisation as a way of clearing roots from 
denominators. 
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14. 
15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 
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. * For a natural number n > 1, show that 


2(vn+1- vn) < 5 <2(¥a—va=T). 


Rationalise 


L 
272473" 


= =: More precisely, obtain integers a, b, c, d, e, f and g such 


J2+ 73 


Rationalise 
that 


aR = “((av3 +b) + (cV¥2 4+ d)V3 + (eV2 + f)V9]. 


Let x, y and z be nonzero real numbers, such that x + y + z = 0. Explain why 
the sum of any two of them is also nonzero, and compute all possible values 
of each of the following expressions: 

x2 y z 
@ 5 at eT 
x y zg 
(b) (y+z)3 + (x+z)3 + (xt+y)3° 
Let a and b be distinct integers. Find, in terms of a and b, the quotient of the 
division of a® — b™ by (a + b)(a? + b?)(a* + b*)(a® + b8)(a!® + b!). 
* Given n > | integer and a, b € R, prove that the following factorizations are 
valid: 


(a) a’ —-b’= (a- b)(a"™! ae a’ 2b a a’ 32 Hever ers b’-!), 
(b) a? +b" = (a+b)(a"“! —a"2b +a" 32 —--- +b"), provided n is odd. 


Write x*+ + 4y* as a product of two non constant polynomials in x and y, both 
having integer coefficients. 

(Canada) Let a, b,c € Z. Prove that 6 divides a+ b+ c if and only if 6 divides 
e+b4+c’. 

(Canada) If a, b and c are real numbers for which a + b + c = 0, show that 
a+b4+c =3abe. 

Prove the double radical formula, also known as Bhaskara’s formula?: for all 
positive real numbers a and J, such that a2 > b, one has 


[imo ah =b 
ptilh = om ——. 


5In honor of the Indian mathematician of the XII century Bhaskara II, also known as 
Bhaskaracharya (Bhaskara, the professor). The idea behind Bhaskara’s formula is that, if a and 
b are naturals for which a? — b is a perfect square, then his formula provides a simpler expression 


for 


Vax Vb. 
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23. Show that there do not exist nonzero real numbers x, y and z such that x + y + 
z# Oand 


1 1 1 1 


x+y+zZ "5 yo 
24. (Soviet Union) Let a, b and c be pairwise distinct rationals. Prove that 


1 1 1 
oP * ap * Go 
is the square of a rational. 
25. (TT) Let a, b and c be distinct rationals. If 3/a + Jb € Q, prove that ¥/a, 
/b EQ. 
26. (TT) Let a, b, c, d, e and f be real numbers such thata+b+c+d+e+f=0 


anda? +b3>+c3+d>+e3 +f? = 0. If no two of them are opposite to each 
other, prove that 


(a+c)\(a+d)(at+ej(at+f) = (b+ c\(b+a(b+eyb+f). 


27. (Poland) For positive integers a < b, do the following items: 


(a) Show that b? < b? + 6ab +1 < (b+ 2)°. 
(b) Find all such a and b for which both a? + 6ab + 1 and b? + 6ab + 1 are 
perfect cubes. 


2.2 The Modulus of a Real Number 


We start this section by recalling the definition of modulus of a real number, a 
concept which will be important in a number of places hereafter. 


Definition 2.8 For x € R, the modulus of x, denoted |x|, is defined as 


ic x, ifx>0 
~ (=x, ifx<0° 
As an example, since —5 < 0, we have | — 5| = —(—5) = 5; analogously, 


| — /3| = —(—V3) = V3 etc. More generally, an immediate consequence of the 
definition is that |x| > 0 for all x € R, with equality if and only if x = 0. Moreover, 
one always has 


x < |x| =|—41, 
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with equality if and only if x > 0. Note also that 
|x| = Vx? = max {x, —x}. (2.6) 
The simplest modular equation is the equation 
|x — al = b, 
where a and b are given real numbers. Since |x — a] > 0, such an equation does 
not admit roots when b < 0. On the other hand, when b > 0, it follows from the 
definition of modulus that one must have either x — a = b orx—a = —b, from 
where we get the roots 


x=at+b,a—b. 


The coming example shows how to solve a more elaborate equation in a single 
variable, involving the concept of modulus of a real number. 
Example 2.9 Solve equation |x + 1| + |x —2| + |x—5| = 7. 


Solution First of all, note that 


x+1,ifx>-l 
1 — 3 —_ 
ered eases 
=oS x—2,ifx>2 
~ lax+2,ifx<2 
and 
ease x—5,ifx>5 
~ (-x+5,ifx<5/ 


Now, the conjunction of the conditions x < —l orx > —l,x < 2orx > 2, 
x < 5orx > 5 partitions the real line into the intervals (—oo, —1), [—1, 2), [2, 5) 
e [5, +00). Hence, in order to simplify the left hand side of the given equation, we 
separately consider x as varying in each one of these intervals. We thus obtain 


—3x +6, if x <—-l 
—x+8,if -—l<x<2 
x+4,if2<x<5 

3x—-6,ifx>5 


Ix + 1] + |x-—2| + |x-—5| = 
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Fig. 2.1. Modulus of a real | x| 
number 


Finally, note that 


© -3x+6=79x= —4; however, since the condition —4 < —1 is not satisfied, 
there are no roots in this case. 

e —-x+8=7< x = 1; since the condition —1 < 1 < 21s satisfied, x = lisa 
root of the equation. 

e x+4=7< x = 3; since the condition 2 < 3 < 5 is satisfied, x = 3 is alsoa 
root of the equation. 

© 3x-6=78x= 3; since the condition e > 5 is not satisfied, there are no 
roots in this case. 


Therefore, the solution set of the given equation is S = {1, 3}. oO 


Back to the study of the properties of modulus, let us represent the real numbers 
as points in the real line. It is easy to see that |x| is simply the distance from (the 
point that represents) x to (the one representing) 0 (cf. Fig. 2.1). More generally, 
given x, y € R, we can look at |x — y| as the distance from the points x and y in the 
real line. In fact, since |x — y| = |y — x|, we can suppose that x < y. Then, 


|x — y| = y—x = distance from x to y in the real line. 


In the above reasoning, if we do not wish to consider which of x and y is the 
greatest one, we can write 


|x — y| = max{x, y} — min{x, y}, (2.7) 


for 
{x, y} = {max{x, y}, min{x, y}}. (2.8) 


There simple remarks suffice to consider the following hard example. 


Example 2.10 (Yugoslavia) Letn € N and M = {1,2,3,...,2n}. Also, let M, = 
{d1,@o,...,@,} and My = {b,bo,...,b,} be subsets of M such that a; < a2 < 
+++ <a,andb; > by >--- > by. If M) UM = M and M; MN M2 = 9, prove that 


lay — by | + laa — ba| + +++ + [an — Dal =n. 


Proof It follows from (2.8) that 


n n 


|_}{max{a;, bj}, min{a;, bi}} = (_J{ai, bi} = {1,2,3,..., 2n}. 


i=1 i=1 
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Also, (2.7) gives 


lay —b\| + |a2 — bp| Si lan — Dy| = 
= (max{ay, b}} + max{ap, bo} +--+ + max{dy, by }) 
— (min{a,,b,} + min{a2, by} + +--+ min{ay, dy}). 


On the other hand, given integers 1 < k,/ <n, with k 4 I, we have 


k>l> max{ax, by} > a> a= min{a;, b;} 


and 
k <1=> max{ag, by} > by > by) = min{aj, bj}. 
Therefore, 
{max{a;, b;}, max{d2, bz},..., max{dy, b,}} = {n+ 1,n+2,...,2n} 
and 


{min{a,, bj}, min{az, bz},..., min{a,, b,}} = {1,2,...,n}. 
Finally, the above relations give 
lay — by| + |az — ba| + +++ + lan — Dal = 
=((n+ 1) 4+ (n+ 2) 4+-+-+2n)—-(14+24+---+n) 
=n +(Lt2+---tn)—-(+24+---¢n) =r’. 


oO 


We continue our study of the concept of modulus with the following important 
result, which is known in mathematical literature as the triangle inequality.° 


Proposition 2.11 For all real numbers a and b, we have 
la + b| < |a| + |B]. (2.9) 


Moreover, if a,b 4 0, then equality holds if and only if a and b have the same sign. 


At the end of Sect.5.2, we will give an explanation of why (2.9), as well as the coming 
inequality (4.6), are called triangle inequality. 
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Proof Since |a + b| and |a| + |b| are both nonnegative, we have 


Ja +b] < lal + |b] la +d)? < (al + [d1)° 
€ (at by’ < |al’ + |b)? + 2\ab| 
<> 2ab < 2\abI, 
which is clearly true. From the computations above it also follows that |a + b| = 


|a| + |b| if and only if ab = |ab]; in turn, this happens if and only if ab > 0. Finally, 
if a,b # 0, then we have the equality if and only if ab > 0. oO 


Corollary 2.12 For all real numbers a and b, we have 
lla] — |b|| < ja— 9B]. 


Moreover, if a,b 4 0, then the equality holds if and only if a and b have the same 
sign. 


Proof Applying the triangle inequality to a — b in place of a, we get 
la] = |(a—b) + b| < |a—b] + [DI 


and, hence, |a| — |b| < |a — b|. Repeating the above argument with the roles of a 
and b interchanged, it follows that |b] — |a] < |a— DJ. 
Now, since |a — b| > |a| — |b|, |b| — |a|, we get 


|a— b| = max{|a| — |b], |b| — lal} = lla — [Al 


where we used (2.6) in the last equality. 

Equality happens if and only if we have equality in at least one of the triangular 
inequalities |a| < Ja — b| + |b| or |b] < |b — | + la]. If a,b ¥ 0 and, say, equality 
holds in the first one of theses inequalities, i.e., if Ja] = |a — b| + |b], then the 
condition for equality in Proposition 2.11 assures that we must have (a — b)b > 0, 
or, which is the same ab > b?. In particular, we must have ab > 0. 

Conversely, suppose that ab > 0, and let us show that equality holds. There are 
two possibilities: a,b > 0 or a,b < 0. Suppose that a,b < 0 (the other case can 
be treated in a similar way). Then, |a| = —a and |b] = —B, so that ||a| — |b|| = 
|(—a) — (—b)| = |b—al = |a—). qo 


Given real numbers a, b and c and applying triangle inequality twice, we get 
lJa+b+cl <|a+b| + |e] < lal + |b] + Icl, (2.10) 
Hence, we have the inequality 


la+b+c| < jal + [| + Ic, (2.11) 
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which is the analogous of (2.9) for three real numbers, instead of two. Therefore, 
we shall also refer to this last inequality as the triangle inequality. 

If a,b,c # 0 and we have equality in (2.11), then we should also have equality 
in all inequalities in (2.10). In particular, we have |a + b| < |a| + |b], and it follows 
from Proposition 2.11 that a and b have equal signs. Since we can also reach (2.11) 
by writing 


lat+b+e| < |a|+|b+c| < lal + |b] + Ic, 


we conclude, analogously, that b and c should also have equal signs. 
Conversely, if a, b and c all have equal signs, say a, b,c < 0 (the case a,b,c > 0 
is completely analogous), then a + b+ c < 0, so that 


la+b+c|=—(at+b+c) = (—a) + (—b) + (-c) = [al + |B] + Icl. 


Hence, we have just shown that equality holds in (2.11) if and only if a, b and c 
have equal signs. 

As we shall see in Sect. 4.1 (cf. Problem 7, page 96), inequalities (2.9) and (2.10) 
can be easily generalized for n real numbers. For the time being, we end this section 
with the following 


Example 2.13 Prove that, for every x € R, we have 
|x — 1] + jn —2| + |x— 3] +--+ + |[x— 10] > 25. 
Proof It follows from the triangle inequality that 
Ix —a| + |x —b| = |x-—al + |b-—2x| = |&-—a) + (6-x)| = |b- al. 
Hence, grouping the summands at the left hand side in pairs, we get, 


Ix — 1] + |x— 10] > [10-1] = 9; 
|x — 2] + |x—9| = |9-2| =7; 
|x — 3] + |x—8] > [8-3] =5: 
|x — 4] + |x—7| = 17-4] = 3; 
Ix—5| + |x—6] > [6-5] = 1. 


IV 


Adding these inequalities, we obtain that of the statement. oO 
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Problems: Section 2.2 


1. * Given real numbers a and b, show that 


0, ifb<0 
{x € R; |x-—a| < b} = 4 {a}, if b=0 ; 
(a—b,a+b), ifb>0 


Do the same for |x — a] < b, |x —a| > band |x —al > b. 
2. * Prove that, for all x, y € R, one has |xy| = |x| - ly]. 
3. Solve, for x € R, the following equations: 


(a) |x] =x-6. 
(b) jx +1] + |x—2| + |x-—5| = 4. 


I ion BL = bet 
4. Solve, for x € R \ {0, 1}, equation 4 = <—-. 


5. Let a, b and c be given real numbers, with a < b. Discuss, in terms of a, b and c, 
the number of solutions of the equation 


Ix —al + |x-—bl=c. 


6. (Mexico) Let r be a nonnegative rational number. Prove that 


r+1 


2 1 
is - v3 < 5Ir- V3. 


(This inequality shows that the rational number > 2 approximates /2 twice as 
better as r does it.) 
7. Prove that: 


(a) If0<x<y, then + < 75- 


b atb 
(b) Ifa,b€R, then HO + Bo > ee. 


+a 


8. Letn > | be an integer. Prove that 
Jx— 1] + |x—2| +--+ + |x—2n| > rr? 


for every real x, with equality for infinitely many values of x. 


2.3 A First Look at Polynomial Equations 


In this section we study some particular types of polynomial equations, postponing 
a much deeper look to [5]. 
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In general, a polynomial equation of degree n is an equation of the form 
ig” + a) eo ax tay = 0, (2.12) 


where n > | is an integer and ao, aj,..., dy are given real numbers,’ with ay, #0. 

The simplest kind of such an equation is the first degree equation ax + b = 0, 
where a and b are given real numbers and a ¥ 0. As the reader certainly knows, we 
have 


b 
ax+b=0Sax=-bSx=--, 
a 


so that 4 is its only root. 
The second simplest kind of polynomial equation is the second degree equation 


ax’ +bx+c=0, (2.13) 


where a, b and c are given real numbers, with a # 0. For reasons that will soon 
be clear, the left hand side of (2.13) is also known as the second degree trinomial 
associated to Eq. (2.13). 

In order to solve (2.13), we let A (one reads delta) denote the real number 


A= b’* —4ac, 
and call it the discriminant of the equation (or of the associated trinomial). As we 


shall see in a moment, the sign of A discriminates whether or not the equation has 
real roots. To this end, we need the following auxiliary result. 


Lemma 2.14 Given a,b,c € R, with a # 0, one has 


24 by + = by A (2.14) 
ax _ : : 
IX Cc a x 5) 


This algebraic identity is called the canonical form of the second degree trinomial 
ax? + bxt+c. 


Here, we are using the concept of a sequence of real numbers. For further details in this respect, 
we refer the reader to Chap. 3. 
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Proof It suffices to see that 


5 (e b “) 
ax’+bx+e=al(x+-x4+- 
a a 


oO 


Remark 2.15 The idea of adding and subtracting a certain summand out of a given 
algebraic expression in order to complete a square, as was done right after the 
second equality in the proof of the previous lemma, is very important and should 
be learned as a kind of algebraic trick that will be useful in a number of places, 
here as well as in [4] and [5]. Later in this chapter, we shall see other interesting 
applications of such a technique. 


Proposition 2.16 Leta, b and c be given real numbers, with a # 0. 


(a) The equation ax? + bx + c = 0 has real roots if and only if A > 0. Moreover, 
if this is so, then its roots are SA and soi 
(b) If A = 0, then the sum S and the product P of the roots of ax? + bx +c = 0 are 


given by S = —4 and P = <. 
Proof 


(a) It follows from (2.14) that 


A 
24+ bx+c=098 wee =-—> (2.15) 
sa 2a 4a 

Since (x + by? > 0 for all x € R, if the equation has real roots, then one must 


have A > 0. In this case, it transpires from (2.15) that x + 2 =+ ae and item 
(a) follows. 
(b) It suffices to compute 


—b-VJV/A —-b+WvWA b 
i 
2a 2a a 
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and 


(a fs 5) _ (—b)?-—A _¢ 


2a 2a 4a? a 


Remarks 2.17 


i. When A > 0, formulae Seva for the roots of ax? + bx +c = 0 are known as 
Bhaskara’s formulae. 
ii. The formulas of item (b) are also known as Viéte’s formulae. 
iii. In the notations of item (a), if A = 0 we say that ax? + bx +c = 0 has two 
equal roots. 


The coming example shows how one can reduce a seemingly complicated 
equation to a simpler one by means of a suitable substitution of variable. 


156 
x2+x° 


Example 2.18 (Brazil) Find all real numbers x such that rP+x4+1= 


Solution By performing the substitution y = x +x, we get the equation y+ 1 = ue 


or, which is the same, y? + y — 156 = 0. For this last equation, since A = 17 — 
4(—156) = 625 = 25’, it follows that y = $= = —13 or 12. Thus, we have 


reduced the original equation to the second degree equations x* + x = —13 and 
x? +x = 12. For the first one, we have A = —51 < 0, so that there are no real 
roots. For the second, A = 49 and, hence, x = = =a = —4 or 3. oO 


Our next example shows that it is sometimes more useful to algebraically 
manipulate a second degree equation than to solve it explicitly. 


Example 2.19 Find the numerical value of (3+ /2)°+(3—/2)° without expanding 
the powers involved. 


Proof Letting u = 3+ /2 and v = 3— V2, we have u + v = 6 and uv = 7, so 
that uv and v are the roots of the equation x* — 6x + 7 = 0. Therefore, making x = u 
and x = v in this equation gives us u? — 6u + 7 = 0 and v? — 60 +7 = O, or, 
equivalently, vu? = 6u — 7 and v? = 6v —7. Multiplying the first equality by wv“ and 
the second one by v*, where k > 0 is an integer, we get 


ykt2 = burt} me Tuk and ykt2 = 6ykt! = Tuk: 
adding both of these, we finally arrive at 


ykt2 Zhe ykt2 = 6(uk! fie oer) — Tu" a v), 
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Writing the previous relation for k respectively equal to 0, 1, 2 and 3, we 
successively get 


w+? = 6(u+v)—7-2=6-6—14= 22; 
w+? = 6 + v*2)—7ut+v) = 6-22—7:6= 90; 
ui+v* = 6(u + v*)— 7 + v*) = 6-90 —7- 22 = 386; 
u+v> = 6(ut + v*)— 7? + v*) = 6- 386— 7-90 = 1686. 
oO 


For the next example, recall that if the sum and the product of two real numbers 
are positive, then both numbers are also positive. 


Example 2.20 Let p and q be given real numbers. If the equation x* + px + q = 0 
has real, positive and distinct real roots, show that the same is true for the equation 
qx” + (p— 2q)x + (1p) = 0. 


Proof Note initially that g 4 0, for otherwise the first equation would reduce to 
x’ + px = 0, which has 0 as one of its roots, thus contradicting our hypotheses. 

Now, let A and A’ be, respectively, the discriminants of x? + px + q = 0 and 
qx? + (p — 2q)x + (1 — p). Let us first show that A’ > 0, which will guarantee that 
the second equation has distinct real roots. Since x7 + px + q = 0 has distinct real 
roots, we have A = p” — 4q > 0. Therefore, 


A’ = (p— 29)’ — 4q(1 — p) 
= p?—4q+4¢? 
=A-+4q’ >0. 


Finally, according to the paragraph that immediately precedes this example, in 
order to show that the roots of gx” + (p —2q)x + (1 —p) = Oare positive, it suffices 
to show that the sum S’ and the product P’ of them are both positive. In order to do 
this, we recall that the roots of x” + px + q = 0 are known to be positive, so that (by 
Viéte’s formulae) —p > 0 and q > 0. Hence, again by Viéte’s formulae, we have 
a ee eo and pote ata 

q q q q q 


as we wished to show. oO 


We finish our discussion of second degree equations with the following important 
remark: if a 4 0 and ax*+bx+c = 0 has real roots a and £ (not necessarily a 4 ), 
then we have the factorisation 


ax’ + bx +c = a(x—a)(x— 8). (2.16) 
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In fact, it follows from item (b) of Proposition 2.16 that, for every real x, 


a(x — a)(x — B) = al’ — (w+ B)x + of] 


It is instructive to compare (2.16) to (2.2). The right hand side of (2.16) is called the 
factorised form of the second degree trinomial ax” + bx + c. 

In turning to more general polynomial equations, it is natural to try to look at 
those of degrees n = 3 andn = 4. In these cases, formulas have been built, in 
terms of the coefficients of the equations, to compute their real roots, if any.® As 
professor I. Stewart teaches us in Chap. 4 of his very interesting book [25], such 
formulas derive from the works of the Italian mathematicians Scipione del Ferro, 
Girolamo Cardano, Niccolo Fontana (conhecido como Tartaglia) and Lodovico 
Ferrari. However, they are too complicated to be useful, and for this reason we 
shall not discuss them here. In order to help convincing the reader, let us just 
mention that Cardano’s formula for the roots of the third degree polynomial equation 
ax> + bx* + cx +d = Ois the following’: 


Vat ve + ¢-P) + ¥a- Ve + Py +2, 


where p = -#, g=prt+ be— Sad andr = 3 (however, see Problems 16, 21 and 22). 


For polynomial Eq. (2.12) of degree n > 5, the Norwegian mathematician Niels 
H. Abel!® and the French mathematician Evariste Galois,!! both of the XIX century, 
proved that there exists no similar formula, built on the coefficients of the given 
polynomial equation, that gives its real (or even complex!) roots. Well understood, 
it doesn’t matter how smart someone is; they proved that it is impossible to find such 
a formula, simply because it doesn’t exist! For a beautiful and elementary account 
of the ideas involved, we refer the reader to [14]. 

Some particular kinds of polynomial equations of degrees 4 and 6, however, 
are sufficiently simple to deserve some attention, specially because appropriate 


8As it happens, these formulas also give the complex roots of the corresponding equations. 
However, we shall postpone any considerations involving complex numbers to [5]. 


°Cf. http://www.math.vanderbilt.edu/~schectex/courses/cubic. 


10Tn his 27 years of life, Abel made several deep contributions to Mathematics, among which the 
most famous one is perhaps the impossibility of solving general polynomial equations of degree 5. 


‘Tn spite of his premature death, when he was only 21 years old, Galois is considered to be one 
of the greatest mathematicians the world has ever seen. His work on the connection between the 
solvability of polynomial equations of degrees n > 5 and Group Theory constitute the foundations 
of what is known today as Galois’ Theory, a branch of modern Algebra with applications to several 
distinct areas of Mathematics. 
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variable substitutions immediately reduce them to second degree equations. Let us 
first examine biquadratic, i.e., equations of the form 


ax’ + bx? +¢=0, (2.17) 


with a,b,c € Randa # 0. The variable substitution y = x* transforms it into 
the second degree equation ay* + by + c = 0, which we already know how to 
solve in R. Therefore, for each nonnegative root y = a of this last equation, solving 
equation x? = a gives us the pair of real roots x = +./a of the original biquadratic 
equation. Conversely, if x = f is a real root of the given biquadratic equation, it is 
immediate to see that y = B? is a nonnegative root of the second degree equation 
ay’ + by +c = 0. We have, thus, proved the following result. 


Proposition 2.21 Given real numbers a, b and c, with a £ 0, the real roots of the 
biquadratic equation ax* + bx* + c = 0 are the reals of the form £./a, where a is 
a nonnegative root of the second degree equation ay* + by +c = 0. 


In order to actually compute the real roots of a specific biquadratic equation, 
instead of invoking the last proposition, it is usually much better to recall our 
previous discussion, remembering that the variable substitution y = x? does the 
job of reducing it to a second degree equation. 


Example 2.22 Find the real roots of the biquadratic equation x*+ + 5x? —7 = 0. 


Solution The variable substitution y = x” leads us to the second degree equation 
y? + 5y —7 = 0, for which A = 53. Hence, the roots of this last equation are 


y= 2S = . Since => == < 0, the real roots of the original biquadratic equation 
are the solutions of x? = = sets , Le., are the real numbers + 4/ ce Ly Oo 


Given n € N, we point out that we can discuss the problem of finding the roots 
of a polynomial equation of the form 


ax" + bx’ +c=0 (2.18) 


in a way quite similar to that used to study biquadratic equations. We refer the reader 
to Problem 16 for the corresponding details. 

Let us now examine reciprocal polynomial equations of degree 4, i.e., polyno- 
mial equations of degree 4 having the form 


ax + be +o7+bxt+a=0, 


where a, b and c are given real numbers, with a # 0. As Problem 28 shows, the 
name reciprocal applies to a larger class of polynomial oats and comes from 
the fact that x € R \ {0} is a real root of it if and only if 4 also is. 

Initially, note that 0 is not a root of the equation above, for a # 0. Therefore, a 
real number x is a root of it if and only if it is a root of the equation 
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5 b a 
ax-+bx+c+—-+—=0, (2.19) 
x x 


which is obtained from the original equation by dividing both sides of the equality 
by x”. Rewrite the left hand side of the last equation above as 


>, 1 1 
a(x + >]+b[(x+-]+c=0. 
x x 


Now, the idea is to perform the variable substitution y = x-+ 1. In order to implement 
it, let us first of all note that, according to (2.1), one has 


2 1)? 2 | 
y=l(xt—)] =x*4+24+5. 
x x 


Hence, x? + + = y’—2, so that solving (2.19) amounts to solving the second degree 
equation 


aly —2) + by+c=0. (2.20) 


However, the above discussion hides a subtlety: it is sure that every real root 
x = a of (2.19) generates the real root B = a + + of aly? —2) + by+c=0. 
Nevertheless, the converse statement is not true: not every real root y = 6 of this 
last equation does generate real roots x = a of the initial reciprocal equation. In 
fact, once we get a real root y = B of a(y* — 2) + by +c = 0, in order to obtain the 
possible real roots of the reciprocal equation corresponding to B, we have to solve 
in R the equation 


1 
x+—=8, 
Xx 


or, which is the same, x” — Bx + 1 = 0. Since the discriminant of this last equation 
is 


A= fp =4, 


it will have real root only if 6? — 4 > 0, ie., only if |B| > 2. 

As was the case with biquadratic equations, in order to actually find the real roots 
of a given reciprocal equation of degree 4, instead of memorizing the result of the 
variable substitution y = x + 1, it is more profitable to follow the steps that led us 


from (2.19) to (2.20). Let us see an example in this respect. 


Example 2.23 Find all real roots of the reciprocal equation 


2xt + 503 + 6x7 + 5x4+2=0. 
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Solution Dividing both sides by x” and grouping summands, we obtain 


2(?+ 5) +5(x+ 2) +6=0 
x2 x ; 


Making the variable substitution y = x + +, it follows that y? = x? + 4 + 2, so that 
the given equation is equivalent to 


20? —2) + 5y+6=0. 


Since this second degree equation has real roots y = —2 and y = -3, it follows that 
the real roots of the original equation are the real roots of the equations x + + =-—2 
and x + i a -}. The first of these equations is equivalent to x? + 2x + 1 = 0 and, 
hence, has two real roots, both equal to —1. The second is equivalent to 2x7+.x+2 = 
0, which has discriminant A = —15 < 0; therefore, it has no real roots. oO 


Problems: Section 2.3 


1. Let b and c be given real numbers, such that the equation x” + b|x| + ¢ = 0 has 
real roots. Prove that the sum of these roots is always equal to 0. 
2. Solve, forx € R, the following equations: 


(a) x+ V¥x+2= 10. 
(b) /x+ 10— J/2x+3 = V1 — 3x. 
(c) x2 + 18x + 30 = 27x? + 18x + 45. 


3. (IMO) In each of the cases (a) A = /2, (b) A = 1 and (c) A = 2, findallx €e R 


for which we have 
Vxtv2x—-—14+ yx-vV2x-1=A. 


4. A math teacher composed three different quizzes. In the first one, he put 
a second degree equation. In the second, he put almost the same equation, 
changing just the coefficient of the monomial of second degree. Finally, in the 
third quiz, once more he put almost the same equation of the first, this time 
changing just the constant coefficient. It is known that the roots of the equation 
of the second quiz are 2 and 3, and that those of the third one are 2 and —7. 
Decide whether the second degree equation of the first quiz has real roots and, 
if this is so, compute them. 

5. Let a and b be two distinct, nonzero real numbers. If a and b are the roots of the 
equation x* + ax + b = 0, find all possible values of a — b. 
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LI: 


12. 


13. 
14. 


15. 


16. 


17. 
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. Let @ and B be the roots of x” — 13x + 9 = 0, anda and b be real numbers such 
that the equation x? + ax + b = 0 has roots a? and £7. Compute the value of 
a+b. 

. Equation x? -+x—1 = 0 has roots wu and v. Find a second degree equation whose 
roots are vu? and v?. 

. The roots of the equation x” — Sx + P = 0 are the real numbers w and f. Find 
a second degree trinomial whose coefficients are expressions built on S and P 
and whose roots are the real numbers aS + P and BS + P. 

. Use the theory of second degree equations developed in this section to compute 
the value of the sum (7 + 4,/3)5 +(7- Ax/3)*, 

. If @ is a root of x? — x — 1 = 0, find all possible values of a* — 5a. 

For which integer values of m does the equation x* + mx + 5 = 0 have integer 

roots? 

Show that, for every a,b,c € R, with a # 0, the equation 


+ 
x-b x-c a 


has exactly two distinct real roots. 
Solve equation x = ,/x— + +,/1- + in the set of real numbers. 
Show that, for every real number a ¥ 0, the equations 


ax? — x? —x—(a+1) =0 and ax*—x-—(a+1)=0 


have a common root. 
(Soviet Union) Do the following items: 


(a) For real x, write the number x? — 3x? + 5x in the form 
a(x — 1)? + bv — 1)? +c(Qx— 1) +d, 


with a,b,c,d € Z. 
(b) Ifx andy are real numbers such that x7—3x"+5x = 1 and y3—3y*+5y = 5, 
compute all possible values of x + y. 


* Letn € N anda,b,c € R, with a # 0. Elaborate, for the equation axe" + 
bx” + c = 0, a discussion analogous to that made on the text for biquadratic 
equations, and which led us to Proposition 2.21. 

* Consider the polynomial equation of third degree x* + ax? + bx +c = 0, 
where a, b and c are given real numbers, with c # 0. If a is a (nonzero) real 
root of it, prove that there exist real numbers b’ and c’ such that we have the 
factorisation 


Pe +arP+bxte= (x—al(? +b'x4 c). 


2.3 


18. 


19. 


20. 


21. 


22. 
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Then, conclude that the original polynomial equation has at most three (not 
necessarily distinct) real roots. Moreover, if this is the case, and a, 6 and y are 
its three real roots, show that 


x + ax +bxt+ec=(x—a)(x— B)(x-y). 


This result generalizes the factorised form (2.16) of a second degree trinomial 
and is a particular case of the division algorithm.'”. 


(a) Show that the real number a = V2 + 4/5 + V 2—J/5 is a root of the 
equation x? + 3x-4=0. 

(b) Conclude that @ is a rational number. 

* Establish the following version of Girard’s relations!* between roots and 

coefficients of a polynomial equation of third degree: if the real numbers ao, a, 

az and a3 (a3 # 0) are such that the equation 


ax + ax + ayx+ay =0 


has (not necessarily distinct) real roots x;, x. and x3, then 
x+y +33= —2 
XX2 + X1X3 + X2X3 = = . (2.21) 


XX2X3 = 


Assume that the equation x* — 3x + 1 = 0 has three real roots, say a, B and y. 
Compute the values of a” + B? + y?,a? + B? + y? anda* + B44 y?*. 

Still concerning the third degree polynomial equation x° + ax? + bx +c = 0, 
with a,b,c € R, prove that there exists a real number d such that the variable 
substitution y = x — d transforms the given equation into one of the form 
y> + py + q = 0, for certain real numbers p and q. 

Concerning the equation x* — 11x + 16 = 0, do the following items: 


(a) Substitute x = u+v and get an equivalent equation in the two real variables 
wand v. 

(b) Impose that uv = 4 (i.e., make uv equals -t times the coefficient of x in 
the original equation) and conclude that the equation in u and v of item (a) 
is equivalent to u& + 16u3 + (24)? = 0. 

(c) Find uw and v, and conclude that one of the roots of the given equation is 


: VII91 JT191 
84 + 38-— 


For more details concerning this point, as well as for the generalization of the result of 


Problem 19, we refer the reader to [5]. 
'3 After Albert Girard, French mathematician of the XVII century. 
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24. 


25. 
26. 
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28. 


29. 
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The items above describe, by means of a specific example, the ideas behind 
Cardano’s formula for the roots of a polynomial equation of third degree. 
* Let x be a nonzero real number, such that (x + 1 = 3. Compute all possible 
values of x? + 4. 

1 


If x is a nonzero real number such that x + 1 = 4 calculate x* + A 


If x° — x— 1 = 0, compute all possible values of (x — A) + (x - a. 
(Singapore) If x* — 4x + 1 = 0, compute all possible values of 


Solve the reciprocal equation x* — 7x? + 14x? —7x +1 = 0. 
Given real numbers ao, a1,...,@n, With a, 4 0, the polynomial equation 


Ap" + An—x” | +++» + ax + ao = 0, 


is said to be reciprocal if a, = d,—,, forO < k < n. If @ is a real root of 
such an equation, show that a # 0 and that + is also a root of it. (Hence, the 
name reciprocal justifies itself by the fact that reciprocal polynomial equations 
have real roots which are pairwise reciprocal.'* In this case, one also says that 
nx” + dn—x | +--+ + ax tag isa reciprocal polynomial.). 

Do the following items: 


(a) * Given a real number x ¥ 0, write x? + + in terms of y= x+ 1. 
(b) Use the result of (a) to reduce the reciprocal equation ax® + bx + ext + 
dx? + cx’ + bx +a = 0, of degree 6, to a polynomial equation of degree 3. 


(Brazil—adapted) We are given nonzero integers a > b, such that the quadratic 
equation x” + ax + b = 0 has nonzero integer roots c > d. Then we form the 
quadratic equation x” +cx+d = 0 and check if it also has nonzero integer roots. 
If this is so, we let e > f be those roots and form the quadratic equation x? + 
ex+f = 0. We proceed in a likewise manner until we reach a quadratic equation 
with nonzero integer coefficients but with no integer roots. The purpose of this 
problem is to find all a and b for which this process continues indefinitely. To 
this end, do the following items: 


(a) Show that if the process is to continue indefinitely, then we can assume that 
a > 0 > band that every subsequent equation x? + ax + B = 0 is also 
such thata > 0> B. 

(b) Under the choices of (a), let x? + mx + n = 0 and x* + px + q = 0 be two 
consecutive equations (i.e., such that p > 0 > gq are the roots of the first), 


'4We shall see in [5] that the same holds for the complex roots of this equation. 
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and let A and A’ be their discriminants, respectively. Show that A’ < A, 
unless n = —lorm=1,n=-—2. 
(c) Conclude that a = | and b = —2 is the only possible choice to begin with. 


2.4 Linear Systems and Elimination 


Let E and F be algebraic expressions in the real variables x,,...,x,. By the 
equation E = F in the real variables or unknowns x), ..., X,, We mean the problem 
of finding all sequences!» (a,,...,@,) of real numbers, such that E and F make sense 
and the equality E = F holds when x; = qj, ..., X, = dy. In this case, each such 
sequence (d),...,@,) is said to be a solution of the equation EF = F. 

Now, let Fj, ..., Em, Fi, ..., Fim be algebraic expressions in the real variables 
X1,...,Xy. The system of equations 


Ei =F, 


Ey = Fy 
(2.22) 


En = Fn 


is the problem of finding all sequences (a,,...,a,) of real numbers, such that the 
substitutions x; = dj, ..., X, = dy solve all of the equations E; = Fj. As above, 
each such sequence (a ,...,d,) is said to be a solution of the system (2.22). To 
solve a system of equations as (2.22) means to find all of its solutions. 

In this section and the next one, we shall learn how to solve some simple (though 
useful) systems of equations. We are also going to see that, under certain conditions, 
an equation E = F (in several real variables) is equivalent to a system of equations 
like (2.22), and this will be a source of a number of interesting examples. 

The simplest—and, for our purposes, also the most useful—systems of equations 
are the linear systems with two (resp. three) equations in two (resp. three) real 
unknowns, i.e., systems of equations of one of the forms 


axt+ by + cz = d 
or 4 a@xt+hy+toz=d , (2.23) 
a3x + b3y + 03Z = d3 


axt+tby=c 
ax + boy = C2 


respectively. Here, the real numbers q;, b;,c;,d; are given and not all zero, being 
called the coefficients of the linear system. 


1S Although the reader is probably acquainted with the concept of sequence, we refer to Sect. 6.1 
for a rigorous definition. 
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The most efficient method for solving linear systems like those above is the 
elimination algorithm,'® also known as gaussian elimination,'’ which is based 
in the following result. 


Lemma 2.24 Let E and F be given algebraic expressions in the real variables x 
and y (or x, y and z). For reals a, b and c, the systems of equations 


E=a d E+cF=a+cb 
F= F=b 


have the same solutions. 


Proof We shall prove the lemma in the case in which E and F are algebraic 
expressions in the real variables x and y; the other case is completely analogous. 
Suppose that x = xo and y = yo solve the system on the left, so that, when we 
substitute x by x9 and y by yo into E and F, both equalities E = a and F = b hold; 
we shall denote such a situation by writing E(xo, yo) = a and F(x, yo) = b. Then, 
substituting x by x9 and y by yo into E + cF gives us 


(E + cF)(X0, yo) = E(xo, yo) + cF (x0, Yo) = a + cb, 


so that x = x9 and y = yo also solve the system on the right. Conversely, if x = x9 
and y = yo solve the system on the right, then, since E = (E+cF)—cF, an argument 
entirely analogous to the above shows that x = x9 and y = yo do solve the system 
on the left. Therefore, both systems have the same solutions. Oo 


Back to the analysis of the linear systems (2.23), let us show that a number of 
careful applications of the previous lemma lead us to quick solutions of them (the 
elimination algorithm consists exactly of this). 

We start by the system on the left, which, for simplicity, we write as 


ax+ by =e 
ext dy=f ~ 


Since at least one of the coefficients a, b, c,d is nonzero, we can suppose, without 
loss of generality, that a # 0 (otherwise, it suffices to rewrite the system in one of 
the forms 


‘An algorithm is a finite sequence of precise procedures that, once followed, give an expected 
result, also known as the output of the algorithm. Although algorithms will play a very modest 
role in this volume, [5] brings a number of very interesting ones. 


'7 After Joanne Carl Friedrich Gauss, German mathematician of the XVIII and XIX centuries. 
Gauss is generally considered to be the greatest mathematician of all times. In the several different 
areas of Mathematics and Physics in which he worked, like Number Theory, Analysis, Differential 
Geometry and Electromagnetism, there are always very important and deep results or methods that 
bear his name. We refer the reader to [26] for an interesting biography of Gauss. 


2.4 Linear Systems and Elimination 47 


cx+dy=f by +ax=e - dy+cx=f 
ax+by=e’ |dy+cx=f by+ax=e” 


according to whether c, b or d is nonzero, and change a by this number in the 
following discussion). 


Then, let a 4 0 and E = ax + by, F = cx + dy. Changing the equation F = f 


by the equation 


Pa Bap sce, 
a a 


we get the system 


E =e 
P= "Eh=f> =e 


> 


and Lemma 2.24 immediately assures that the solutions of this new system coincide 
with those of the original one. Hence, in order to solve that system, it suffices to 
solve this last one. On the other hand, since 


b 
F—“E = (ex + dy) — (ax + by) = (« = ~) y, (2.24) 
a a a 


the last system reduces to 


ax + by =e 
dy=f'’ 


where d’ = d— “ and f’ = f — <e. Now, we shall consider three different cases: 


a 
If d’' ¥ 0 (or, equivalently, ad — bc # 0), then the second equation above gives 
y= L, and the substitution of this value into the first equation gives x = 4(e = 


by) = + (c - of). In this case, the system is said to be determined, for it has a 


unique solution. 

If d' = 0 (or, equivalently, ad — bc = 0) and f’ # 0, the system is impossible, 
for the second equation reduces to Oy = f’, which has no roots at all. 

If d’ = f’ = 0, then the second equation reduces to Oy = 0, an equality which is 
true for all real values of y. Therefore, the system as a whole consists only of the 
equation ax + by = 0, which has infinitely many solutions (making y = a, with 
a € R, we getx = —*4), For this reason, the system is said to be undetermined. 


As was the case in the previous chapter, we would like to stress that, in 


specific examples, rather than memorizing the formulas obtained through the above 
discussion, one should just execute the elimination process. The previous discussion 
(more precisely, (2.24)) makes it clear that this process consists of subtracting an 
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appropriate multiple of the first equation from the second, in order to eliminate the 
variable x from it (and that’s where the name elimination comes from). 


Example 2.25, Use Gaussian elimination to find all real values of a for which the 
system equations 


is impossible, undetermined or determined. 
Solution Subtracting —% times the first equation from the second, we get the 


2 
equivalent system 


2x + ay =3 
Ho $)ya30-0: 


If 2- a # 0, which is the same as a # +2, then the second equation above 
— 3(1=a) 


gives us y = 7, so that the first equation furnishes a single value for x, namely, 
1 3(4-—a) 
= -(33—ay)= 
a eC er 


Therefore, the system is determined whenever a # +2. 
If a = +2, then the second equation reduces to Oy = (1 + 2), which represents 
an impossible equality. Therefore, the system is impossible. Oo 


For a geometric interpretation of Gaussian elimination for linear systems of two 
equations in two real variables, see the problems of Sect. 6.2 of [4]. 

Let us now turn our attention to the linear system on the right, in (2.23). In order 
to analyse it, let EF; = ajx + by + cz, for 1 < i < 3, so that it reduces to 


FE, =d 
Ex, =d>. 
E3 = d3 


As was done for linear systems in two variables, we can suppose that a, # 0 (the 
other cases being totally analogous). Changing equations E, = d) and £3 = d3 
respectively by 


(in order to eliminate the variable x from the second and third equations), we get the 
equivalent system 
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Ey =d\ 
Ey - Et =d)—- od) 
E3— 2E, = d3— Sd 


Since 


= bby + cz, 


and, analogously, E3 — GE, = bhy + c4z, it suffices to solve the system 


axtby+az=d 
bhy+oz=d,, 
bhy + c4z = dy 


where d, = d) — od and di = d3 — adi. 
In case all of the numbers 5), c), b,c’, are equal to 0, the last system above 
reduces to 


ax + by + cz = d) 
Vy a 
C=d; 


If d, # 0 ord, ¥ 0, the system is impossible; if d, = d, = 0, the system is 
undetermined, for it is equivalent to the single equation a;x + bjy + cz = 0, which 
obviously has infinitely many solutions. 

Suppose, then, that at least one of the numbers bj, c5, b4, or c, is nonzero, say 
b’, # 0 (as before, the other cases can be treated quite analogously). Then, applying 
Gaussian elimination to the system 


by +z = d, 
by + c4z = dy, ’ 
we obtain a system of the form 
axtby+az=d 


boy + cz = dy , 
a 11 
C32 = d, 
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which is also equivalent to the original system. The discussion, then, goes on as 
before, and we concentrate our attention in the third equation, C4z = dy . Also as 
before, we have to distinguish three distinct cases: 


* Ifc; # 0, then the third equation above gives a single value for z, say z = y; 
since b, # 0, the substitution of this value into the second equation furnishes 
y= x (ds -—cz) = x (a) — chy). Finally, since a; # 0, the substitution of the 
values thus obtained for y and z into the first equation give us a single value for 
x. Then, we conclude that the system is determined. 

* Ifc{ =O and dj # 0, the system is impossible, for the third equation reduces to 
0z = dj. 

* Ifc{ = dj = 0, then the third equation reduces to the equality 0z = 0, and the 
system as a whole reduces to 


axt+tby+eaz=d 
byt chz= dy 


or, which is the same, 


ey —C1Z 
by = dy cz | 


Since a; and b‘ are both nonzero, for each real value z = y the system 
corresponding to 


os di —c1y 
By) y= GY 


is determined; therefore, the original system in x, y, z is undetermined. 


We can summarize the above discussion by saying that the elimination algorithm 
for linear systems of the form 


ax+tby+az=d 
ax + boy + c2z = dz 
a3x + b3y + 03z = d3 


consists in performing, one by one, the following three steps: 


1st. Eliminate the variable x from the second and third equations, by adding to 
these equations appropriate multiples of the first one, thus obtaining a system 
of the form 


axtby+az=d 
byy+c4z = dy . 
by + c4z = ds 
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2nd. Eliminate the variable y (in case b, # 0 in the last system above) from the 
third equation, by adding to it an adequate multiple of the second equation, 
thus obtaining a system of the form 


axtbyt+az=d 
bhy+oz=d;, . 
c3z = dy 


3rd. Analyse equation c{z = d¥ and, after this, the other two equations in 
succession, in order to decide whether the original system is determined, 
undetermined or impossible. 


As before, it is much more useful to keep the general steps above in mind than 
to try to memorize any of the expressions obtained through the calculations in the 
previous discussions. Let us see one more example to illustrate this point. 


Example 2.26 Find all real values of a for which the system of equations 


x+y-az=-l 
2x+tay+z=1 
ax+y-—z=2 


is impossible. 


Solution Multiplying the first equation respectively by 2 and by a, and subtracting 
(also respectively) the results from the second and third equations, we get the 
equivalent system 


x+y-—az =-1 
(a—2)y+ (1+ 2a)z = 3 : 
(l—ay—(l—-a’)z =2+a4 


If a = 1, the last equation reduces to the impossible equality 0 = 3, and the 
original system is impossible. If a # 1, add to the second equation = times the 
third one, thus obtaining the equivalent system 


x+y-az =-1 
(a* +a—- 1)z — ra—3at7 | 
(—a)y—-(l-@)z=2+a 


(Observe that we have slightly changed the second step, so that the roles of the 
second and third equations of the last system are interchanged—it is the second 
equation that, now, has just one variable. Obviously, this is perfectly right, and shows 
that the elimination algorithm is quite a flexible one.) 
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Now, if a7 +a—1 = 0,ie., ifa = ae then the system is impossible, 
for —a* — 3a + 7 # 0 for these values of a and, hence, the second equation 
14/5 


reduces to an impossible equality. If a # —>*-, then the second equation gives 
us a single possible value for z; in turn, upon substitution of this value of z into the 
third equation, we find a single possible value for y; finally, putting these values for 
y and z into the first equation, we find a single possible value for x, so that the system 
is determined. 

We conclude a = 1 anda = =iaws are the values of a for which the original 
system is impossible. Oo 


We finish our discussion of linear systems by observing that Gaussian elimination 
algorithm can easily be put to work to the analysis of the general linear system in m 
equations and n unknowns 


Qy1X1 + Ay2X2 +++ + AnXn = dy 


A21X1 + A22X2 +++ + ArnXy = bp 
. 3 (2.25) 


Ami X1 + GnaX2 +++ + AmnXn = Bin 


here, the a and b; are given real numbers, such that at least one of the aj is nonzero. 

Apart from an important particular case of general linear system, which will 
make its appearance in Sect. 18.1 of [5] (and will be analysed there, by other 
methods), we shall not make a systematic use of such systems along these notes; 
hence, we shall not develop the general analysis of the application of the elimination 
algorithm to them. If it is the case we have to solve such a linear system (as in the 
example below), some clever reasoning, perhaps with the aid of Lemma 2.24, will 
suffice. 


Example 2.27 (OCM) Find all real solutions x, x2, ..., X109 of the linear system 
Xj + X2 + X3 =0 


Xo + x3 + x4 = 0 


X9g + X99 + X100 = 0 
X99 + X1900 + x1 = 0 
X100 +X} +22 =O 


Solution Observe that each x; appears in exactly three of the given equations. 
Therefore, adding all of them and dividing both sides by 3, we get 


Xy + x2 +23 +--+ + X100 = 0. 
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In order to find x; = 0, just note that 


O = x1 + (X2 + x3 + x4) + +++ + (Xog + X09 + 100) 
=x,+04+04+---+0 = x. 
For x. = 0, write 
O = x2 + (43 + X4 + X5) +--+ + (X99 + X100 + X1) 
=%2+0+04+---+0 = x. 


Now, x1 + x2 + x3 = 0 implies x3 = 0. Then, x2 + x3 + x4 = 0 implies x4 = 0, and 
so on, so that all of the x;’s are equal to 0. oO 


We refer the interested reader to Chap. 1 of [18] for quite a detailed exposition 
of the elimination algorithm for general linear systems. 
Problems: Section 2.4 


ax + by=e 
cxt+dy=f’ 


_ 


. Assume that, in the linear system we have a, b, c, d, e, f # 0. 


Prove that: 


(a) $F & <> the system is determined. 
(b) ¢ =4=£ the system is undetermined. 
(c) $= 5 x 5 <> the system is impossible. 


2. Find all real values of a for which the system of equations 
x+2y—3z=4 


3x—-y+5z=2 
4xty+(a@-14)z=at+2 


is impossible. 
3. Solve the system of equations 


| o+ 


+ + 


NIWA TA Le 
ll 
| 
= 


KINI 1 
SINS INS Jeo 


4. For 1 < i,j < 3, let aj be given real numbers, such that a?, + a?, + a}, = 1 for 
1 <i <3 and aaj + aaj + agaj = 0, for 1 < i,j < 3 withi ¥ j. Given real 
numbers b,, bz, b3, solve the linear system 
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1X1 + ay2X2 + 443x3 = Dy 
2X1 + d22X2 + a73x3 = bo , 
431X, + A32X2 + 433X3 = b3 


writing x, x2 and x3 in terms of the aj and b;. 
5. With respect to the linear system (2.25), do the following items: 


(a) Ifb; = bp =--- = by, = 0, then the system always has at least one solution. 

(b) If the system has at least two different solutions, then it has infinitely many 
solutions. 

(c) If the system has only one solution when bj = bz = --- = b,, = 0, then, for 
general values of b,, bo, ..., bj, it has at most one solution. 


2.5 Miscellaneous 


Let us now turn our attention to the next simplest type of system of equations, 
namely, second degree systems. Ultimately, such a system is just a rephrasing of 
a second degree equation in terms of a system of two equations in two unknowns. 
Nevertheless, the reader will be amazed on how they provide a number of interesting 
applications. 


Proposition 2.28 Let S and P be given real numbers. The system of equations 


aaa aed (2.26) 
xy=P 

has real solutions if and only if S* > 4P. Moreover, in this case, the solutions are 

given by x = a, y = B or vice-versa, where a and B are the roots of the second 

degree equation u* — Su+ P = 0. 


Proof First of all, if aw and are the roots of u — Su + P = 0, then we know from 
Viéte’s formulas (see item (b) of Proposition 2.16) thata + 6 = S andap = P. 
Therefore, the pairs x = a, y = B andx = f, y = a do satisfy the system of 
Eq. (2.26). 

Conversely, let x = xo, y = yo be any solution of that system. Then, the first 
equation gives yp = S — Xo, and the second equation then gives P = xpyo = xo(S — 
xo), or, which is the same, x3 -—Sxo +P = 0. Therefore, xo is a root of u7—Su+P = 0, 
from where we get x» = a or xo = f. Sincea+f = S, we have two possibilities: 


¢ Ifxo =a, then yy = S—x =S—a= Bp. 
° Ifxo = B, then yy =S—x =S—fB =a. 
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The following examples show how we can sometimes reduce the search for 
solutions of more complex systems of equations to that of simpler systems, of one 
of the types we met so far. 


Example 2.29 Find all real solutions of the system of equations 


(x? + 1)G7 + 1) = 10 
(x + y)(xy — 1) =3 


Solution First of all, rewriting the left hand side of the first equation as 
4)674)1) =e%r 42441 
= (ay)? + [+ 9)? — 2ay] + 1, 
and letting x + y = a and xy = b, we get the system 


b?+a*—2b=9 
a(b—1) =3 


Now, writing 
b? +a*—2b =a’ + (b —2b+1)-1 
=a+(b-1P-1 
and making the substitution b — 1 = c, we reach the system 


av+c=10 
ac = 3 , 


By squaring the second equation, we transform this last system into one of the 
form (2.26), whose unknowns are a? and c?, and such that S = 10 and P = 9. Since 
the roots of the second degree equation u? — 10u-+ 9 = O are | and 9, it follows that 
a2 = 1 or 9 and, hence, a = +1 or +3. Then, we have the possibilities 


(a,c) = (1,3), (-1, —3), 3, 1) or (—3,-1), 
from where 
(a,b) = (1,4), (-1, —2), (3,2) or (—3,0). 
Finally, each of these pairs (a, b) give us another system of the form (2.26), with 


unknowns x and y. Solving the four systems thus obtained, we arrive at the solutions 
of the original system: 
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(x,y) = (1, —2), (—2, 1), 1, 2), (2, 1), (0, —3) or (—3, 0). 


oO 


Some seemingly complicated equations can be easily solved, provided we find a 
way to transform them into systems of equations. Since there is no general procedure 
that tells us when or how this can be done, each equation should be analysed 
separately. In this respect, the following example shows that a frequently useful 
algebraic trick is the introduction of new variables. 


Example 2.30 (Israel) Find all real solutions of the equation 
V1I3+x+ V4—x = 3. 


Solution Letting a = 13+ x and b = 1/4 —x, we geta+ b = 3 anda*+b* = 
(13 +x) + (4—x) = 17. Hence, we have reduced the problem of solving the giving 
equation to that of solving the system of equations 


a+b=3 
a’+b4=17" 
Applying formula (2.1) for the square of a sum twice, we get 
Waat+h = @ +b’) -20°b° 

= [(a + b)? — 2ab]? — 2(ab)* 

= (9—2ab)* — 2(ab)” 

= 81 — 36ab + 2(ab)’, 

so that 


(ab) — 18(ab) + 32 = 0. 


Solving for ab the second degree equation above, we find ab = 2 or ab = 16. 
Therefore, there are two distinct possibilities: 


,fatb=3 fatb=3 
ee ne eee 


Possibility (i) gives a = 1 and b = 2, or vice-versa. If a = 1, then 13 + x = 1 
and, hence, x = —12. Ifa = 2, then 13 + x = 16 and, hence, x = 3. Possibility (ii) 
does not give any real solutions, for, in this case, a and b would be real roots of the 
second degree equation u* — 3u + 16 = 0, which has none of them. oO 
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The following lemma states a relatively easy sufficient condition to transform the 
search for the roots of an equation in one unknown into that of a system of equations. 


Lemma 2.31 /f E,, E2,...,E, are expressions involving one or more real vari- 
ables, then 
E, =0 
2 2 2 fy = 
E, +E +++ =06 ; (2.27) 
E, =0 


Proof This is an easy generalization of Corollary 1.5 and of Problem 2, page 10. 
Oo 


Example 2.32 Find all real roots of the equation 
ty +4+y42=2y+ Qxy, 
Solution We can write the given equation as 
(xy? — 2x7y + 1) + 6 —2y+ 1) =0, 
or, which is the same, 
(’y-— 1)? + (¥-1) =0. 


Therefore, by the previous lemma, the equation is equivalent to the system of 
equations 


xry-1=0 
y-1 =0’ 
whose solutions y = 1, x = +1 can be obtained without difficulty. Oo 


Problems: Section 2.5 


1. (IMO) Find all real numbers x, y and z, such that the sum of any of them with 
the product of the other two is always equal to 2. 

2. Let a be a nonzero real constant. Find, in terms of a, all real numbers x and y 
that solve the system of equations 


wtx=atl 


2 =a 
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3. Solve, for x, y € R, the system of equations 


x+y Y= 5 
xy xy 
x+y xy” 6 


4. (IMO—adapted) Consider the system of equations 


axt + bx) +0 = x2 
axs + bx» +0= %3 
ax, + bx3 +0= x) 


whose unknowns are x1, x2,x3, where a, b and c are given real numbers, with 
a# 0.If A = (b—1)* — 4ac, do the following items: 


(a) If A < 0, then there is no real solution. 
(b) If A = 0, then there is exactly one real solution. 


5. (TT) Find all real solutions of the system of equations 


e=2y-1 
y=2z-1. 
2 =2x-1 


6. Solve, for x, y € R, the equation x? + 2xy + 3y? + 2x+ 6y+3=0. 
7. (IMO) Find all real values of a for which the system of equations 


x+y =4z 
3x+4y+z=a’ 


with unknowns x, y and z, has a single solution. 
8. (NMC) Find all real numbers x, y, z greater than 1, such that 


3 3 3 
Lerten tae rie Chuan Vy+2+Vve¥2). 


9. Find all real roots of the equation /x + 5+ /11—x = 6. 
10. Find all real roots of the equation ¥5— /5—x =x. 
x2 


11. (Canada) Find all real roots of the equation x* + Gy = 3. 
12. (Canada) Solve the system of equations 


2.5, 


13. 


14. 


15. 


16. 


17; 
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(Romania) Let a, b, c and d be given real numbers, such that 

a+b+c<3d 

b+c+d<3a 

c+td+a<3b- 

d+a+t+b<3c 
Prove thata =b=c=d. 
* Let |, £o,...,£, and F), F>,..., F, be given expressions in one or more real 
variables, such that E} < F, Ey < Fo, ..., E, < Fy. Prove that the equation 


Bp pied By SF oe F, 


is equivalent to the system of equations 


E, =F, 
Ey = Fy 
Ey = Fy 


(Romania) Find all real roots of the equation 


V 4x2 —x4-—34 V4 -yt 4+ V42-7445=6. 


Solve, in the set of positive reals, the system of equations 
4 Sy 
x+ Fr 
4 _ 52 
z+i= = 


(Soviet Union—adapted) 


(a) For x > 0, prove that x + 2 > 2/2, with equality if and only if x = V2. 
(b) Solve the system of equations 


2y=x+4 
wz=yts : 
2x=z+2 


Chapter 3 
Elementary Sequences 


An (infinite) sequence of real numbers is an infinite ordered list (a,,a2,q3,...) 
of real numbers, i.e., an infinite list of real numbers, in which we specify who is 
the first number of the list, who is the second, third and so on. It is customary to 
denote an infinite sequence as above simply by (a;)x>1. A (finite) sequence of real 
numbers is a finite ordered list (a, a2,...,An) of real numbers, i.e., a finite list of 
real numbers, in which, as was the case with infinite sequences, we specify who is 
the first number of the list, who is the second, the third, ..., the n-th. Is it customary 
to denote a finite sequence as above simply by (a;)1<x<n. In any of the cases above, 
we say that a, is the k-th term of the sequence.! 

Our aim in this chapter is to study some types of elementary sequences that 
occur quite frequently in elementary Mathematics. Along this process, we shall 
introduce several definitions and properties that will apply to general sequences as 
well. Whenever there is no danger of confusion, we shall concentrate our discussion 
on infinite sequences, letting to the reader the task of adapting all that comes to finite 
sequences. 


3.1 Progressions 


We say that a sequence (a,)x>1 is defined by a positional formula if the values 
a, € R are given by means of an expression that depends on k. 


‘As we will see in Chap.6, an infinite (resp. finite) sequence of real numbers is, rigorously 
speaking, a function f : N > R (resp. f : {1,2,..., n} — R, for some n € N). Thus, the above 
notation comes from the shortcut of letting a, = f(k), for every k € N (resp., every 1 < k <n). 
However, for the time being, the above definition will be harmless and sufficient for all of our 
purposes. 
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Example 3.1 The sequence (a,),>1 of the perfect squares is the infinite sequence 
(17,27, 3?,...). Therefore, we have a; = 17, ay = 27, a3 = 3? and, more generally, 
a, = k’, for every integer k > 1. 


An alternative to defining a sequence by means of a positional formula is to give 
a recursive definition of it. This means that we should specify a (finite) number 
of initial terms of the sequence, as well as a recipe to calculate the other terms by 
recurring to those that precede it. Let us see an example. 


Example 3.2 Consider the sequence (a;),>1, defined recursively by aj = 2, az = 5 
and 


ag = 2ag—1 — ag—-2, V kK = 3. (3.1) 


If we take k = 3 in the relation above, we get a3 = 2a2 —a, = 2:5—2 = 8; if 
we now take k = 4, we find a4 = 2a3 — a, = 2-8 —5 = 11, and so on. In the 
above, relation (3.1) is the recurrence relation, or simply the recurrence satisfied 
by the sequence (ax)x>1- 


In the previous example, it is important to notice that we were only capable 
to compute the value of a; because we knew, beforehand, both the recurrence 
relation (3.1) as well as the values of a; and az. Knowing just the value of a; 
would not suffice, for (3.1) computes each term as a function of the two ones that 
immediately precede it. On the other hand, if we changed the values of a; and az (but 
maintained the recurrence relation), we would generally change the corresponding 
values of subsequent terms (just make a; = 1 and az = 2 in the given recurrence 
relation, and compute the new value of the a3). 

It is also worth observing that there are other equivalent ways of writing the 
recurrence relation of the former example. For instance, letting k — 2 = j, we get 
k = j +2 and, hence, aj42 = 2aj4; — aj, Vj = 1 (since k > 3 = j = 1). In other 
words, the name we give to the index of a sequence (i.e., j, k etc.) is not relevant to 
its definition; we could equally well write it as ag42 = 2az4+1 — ay, Vk = 1. 

Sometimes, we will also need to list the terms of a sequence starting from zero, 
denoting it as (ax)x>0. At first, this could look a little odd, for the first term of the 
sequence would be ao, the second would be a, etc. Nevertheless, since this can be 
a natural choice in a number of contexts, we urge the reader to get used to it. For 
example, the sequence (1,4, 9,16,...) of the perfect squares could be written as 
ay = (k + 1)’, for all k > 0. 

At this point, a natural question poses itself: if a certain sequence is defined 
recursively, how can we find a positional formula for its terms? Unfortunately, this 
question does not admit a simple answer. Instead of trying to attack this general 
problem right now, we shall first discuss some simple examples, postponing a deeper 
analysis to Chaps. 3 and 21 of [5]. 

We begin by looking at arithmetic progressions. 


Definition 3.3 A sequence (a;,),>1 of real numbers is an arithmetic progression 
(we abbreviate AP) if there exists a real number r such that 
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ati = a+r, (3.2) 


for every k > 1. 


In the previous definition, we refer to the real number r as the common 
difference of the AP. We also observe that, for an AP (ax)x>1 to be completely 
determined, one has to give both its first term a; and its common difference r. For 
example, the sequence (ax),>1 such that a; = 2 and ag41 = ag + 3 fork > 1, is the 
AP with initial term 2 and common difference 3, and it is completely determined by 
the recurrence relation it satisfies and by the value of its first term. However, if we 
only knew that a4, = a, + 3 fork > 1, we would not have just one AP, for we 
would not know how to start it. 


Example 3.4 Let (ax)g>1 be an AP of common difference r. Prove that the 
sequences (by)x>1 and (cx)x>1, defined by by = ar, and cy = drx—1 fork > 1, 
are also AP’s, with common differences equal to 2r. 


Proof We look at the sequence (b;)x>1, the case of the sequence (cx)x>1 being 
totally analogous. By the definition of an AP, it suffices to show that b,4, —b, = 2r, 
fork > 1. Since (az)z>1 is an AP with common difference r, it follows from the 
definition of b, that 
bei — be = Ar(K41) — Are = Ar+2 — Ax 
= (2x42 — Gre+1) + (A241 — Gx) 


=r+r=2r, 


as we wished to prove. oO 


Another useful recursive characterization of AP’s is the one given in the 
following result. 


Proposition 3.5 A sequence (ax)x>1 of real numbers is an AP if and only if 
O42 + ag = Zags, Vk = 1. (3.3) 


Proof By definition, a sequence is an AP if and only if ag —a, = a3 -—a@ =-:--, 
ie., if and only if we have ay42— ag+1 = ax+1 — ay for every integer k > 1. This is 
an equivalent way of writing (3.3). oO 


The next result collects two useful formulas for AP’s. 
Proposition 3.6 If (ax)x>1 is an AP with common difference r, then 


(a) a, = a, + (k— 1)r, for every k > 1. 
(b) ay tag +--+ +a, = Maran) for every n> 1. 


64 3 Elementary Sequences 


Proof 
(a) Diagram 


hr +r +r +r +r 
QQ. DD SEES ee EES aE 


makes it clear that, in order to reach a; from aj, itis necessary k—1 steps, where 
each step consists of adding r to a term. Therefore, in order to obtain a, from 
a1, we have to add (k — 1)r to a1, so that ag = a, + (k—I)r. 

(b) From diagram 


+r “Er F = =F =7 
aq — a — AO a Ee = Hs 


we conclude that 


aq+a,= (a2 —r) + (@p-1 +r) =a. + a@)-1, 


ay + An—1 = (43 — 1) + (Qn—-2 + 1) = 43 + Gn—2 
etc. Therefore, letting S = a; + az +--+ + a,, we have 


2S = 2(ay + a2 + 43 + +++ + Ana + An-1 + Gn) 
= (ay + Gy) + (Gz + Gn—1) + (43 + Gn—2) + +++ + (Qn + 41) 
= (a) + ay) + (Qi + Qn) + (Qi + Qn) +07 + (Qi +) 
ae 
n summands 


= n(a, + a), 


and the formula follows. 
oO 


The formulas of items (a) and (b) of the previous proposition are respectively 
known as the formulas for the general term and for the sum of the first n terms of 
an AP. Let us see two simple applications of them. 


Example 3.7 Compute, in terms of n, the sums of the first n positive integers and 
the first n odd positive integers. 


Solution The positive integers form the AP 1,2,3,4,..., of common difference 1. 
Since its n-th term is obviously equal to n, item (b) of the previous proposition gives 


n(n + 1) 


Loe 
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The odd positive integers form the AP 1,3,5,7,..., of common difference 2. Its 
n-th term (i.e., the n-th odd positive integer) is, by the formula for the general term 
of an AP, equal to 1 + (n— 1)-2 = 2n — 1. Thus, again by item (b) of the previous 
proposition, the sum of the first n odd positive integers is 


A+ Ga] 2 
= 5 =n. 
Example 3.8 Let (ax)x>1 be the sequence given by a; = | and 


LSS e+ n= 1) 


ak 


ak41 = 10a 


for every integer k > 1. Compute a, as a function of k. 


Solution It is clear that all terms of the sequence (ax)x>1 are positive, so that we 
can define the sequence (b,)x>1 by letting by = me for every k > 1. Then, the 
recurrence for (ax)>1 gives us 


1 1+ 2a, 1 
Dey = Ss = — +2 = 42, 
Ak+1 ak ak 


so that (by)x>1 is the AP of initial term b; = a = | and common difference 
2. Therefore, this AP coincides with that of the odd positive integers, so that the 
previous example gives b, = 2k — 1, for every k > 1. Hence, 


1 1 


ae Tae 


oO 


Another useful class of sequences is the one of geometric progressions, according 
to the next definition. 


Definition 3.9 A sequence (a;),>1 of real numbers is a geometric progression (we 
shall abbreviate GP) if there exists a real number q such that 


Akt = + ak, (3.4) 


for every k > 1. 


The real number gq that appears in the definition of a GP is its common ratio. 
Observe that, if gq = 0, then a, = 0 for every k > 1. On the other hand, if g = 1, 
then a, = a; for every k > 1. As it happens with AP’s, a GP (ax)x>1 is completely 
determined provided we know its first term a; and its common ratio gq. 


Example 3.10 For a fixed real number q # 0, the sequence (a,)x>1 such that a, = 
gq‘ for every k > 1 (i.e., the sequence formed by the powers of g, with natural 
exponents) is a GP of common ratio q. If q < 0, then Problem 6, page 11, shows 
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that a, is positive if and only if k is even; if 0 < g < 1, then Corollary 1.3 gives 
a, >a, > a3>-:->0;ifg > 1, then, again by that result, we have 0 < a; < az < 
a3 <ce. 


Another useful recursive characterization of (almost all) GP’s is the one given by 
the following result. 


Proposition 3.11 A sequence (ax)x>1 of nonzero real numbers is a GP if and only if 
Op ode = G4; (3.5) 

for every k = 1. 

Proof By definition, (a;)x>1 is a GP (of common ratio q) if and only if 


a2 a3 a4 


— SS reid. 
a a2 a3 


In other words, (ax)x>1 is a GP if and only if for every integer k > 1, we have 
Ct? — “itl and this is an equivalent way of writing (3.5). o 


K+ a” 


As we have done with AP’s, our next result brings formulas for the general term 
and for the sum of the first k terms of a GP. 


Proposition 3.12 Jf (a,)x>1 is a GP of common ratio q, then: 


(a) ay =a, -g |, for everyk > 1. 
(b) Ifq #1, thena, +a, +-:-+a, = “tT for every n > 1. 


Proof 
(a) The proof we present parallels that for the general term of an AP (see the proof 


of item (a) of Proposition 3.6): diagram 


“q “q “q “q “q 
ay —— A 85 F7/e-1 | 


makes it clear that, in order to reach a, starting from aj, it is necessary k — 1 
steps, where each step consists of multiplying a term of the sequence by gq. 
Hence, to reach a, we have to multiply a; by g exactly k — | times, so that 
ar= a: go, 

(b) Letting S, = a; + a2 +++: + dp, it follows from (3.4) that 


qSn = g(a, + dg +++ + Gn-1 + An) 
= qa + Gaz +++" 4+ Gan-1 + Gan 
= 42 + 43 +++ + Gn + An+1- 
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Therefore, 


(q—V)Sn = Sn — Sn 
= (a2 $3 +--+ Gy 4+ Gn41) — (Qj a2 +--+ a) 
= (dz + 03 +++" Ga) + Gnd — i — (a2 + +++ + Ga) 


= 4n+1— 41, 


where, in the last passage, we cancelled out the summand a2 + a3 + +++ + dy. 
It now suffices to divide both members of the equality (¢ — I)S = an41 — ay 
byqg-l. 

oO 


Example 3.13 Let (a,)x>1 be an AP of natural numbers, with common difference 
r > 0, and (by)x>1 be a GP of nonzero real numbers, with common ratio g. Consider 
the sequence (c,)x>1, such that c, = ba, for every integer k > 1. Prove that (cx)x>1 
is a GP of common ratio q’. 


Proof It suffices to show that the ratio of any two consecutive terms of the sequence 
(cx)x>1 18 always equal to q’. For this, we use the formulas for the general terms of 
AP’s and GP’s, as well as the result of Problem 4, page 11: 


+11 
Ck+1 _ Days) = byq*+! 


— 7tkt1-% — Zr 
=a 74 = q. 
Ck ba, biq* : oO 


The purpose of the next example is to call the reader’s attention to the fact that 
the idea used in the proof of item (b) of Proposition 3.12 is as important as the actual 
result it collects. Hence, the reader should keep this idea in his/her mind. 


Example 3.14 Compute the value of the sum 
2-147-3412-37+17-33 +---4+ 497-3” 4+ 502-31, 


where, from left to right, the k-th summand equals the product of the k-th term of 
the AP 2,7,12,...,502 with the k-th term of the GP 1, 3, 32,...,3!. 


Solution Arguing exactly as in the proof of item (b) of Proposition 3.12, let S 
denote the desired sum, and compute 3S (we use the factor 3, for it is the common 
ratio of the GP 1,3, 37,...,3!): 


98 = 9.34. 7.39? + 12-37 +.17-3* 4+-+4 407.391 4502.3", 
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Hence, 


28 = 3S—S 
=2-347-37 412-37 417-34 +--+ 4+ 497-31 4 502. 310 
= 9-178 = 1 = 19 a 7 3 — S 

= (502-3! — 2) —5(3 +37 + 3° +34 +4--- +31") 


= (502-310! — 2) — Git 3), 


where we grouped multiples of equal powers of 3 in the third equality, and used the 
formula of item (b) of Proposition 3.12 in the fourth one. Finally, grouping together 
the summands of the last expression above, we get S = +(999 3101 4 11). oO 


Problems: Section 3.1 


1. Compute the first four terms of the sequence (a,,),>1, defined by a; = 1| and 
a+ = ay, +a,+1,fork > 1. 
2. Write a positional formula for each of the following sequences: 


(ay G25, 24567 8) 


(b) (5.353 50517989 98°): 
fey (14,345,592, Fass) 


ue 
AL 
n 
an 
x 
oo 
— 


3. Let (dn)n>1 be a ae of positive real numbers satisfying the recurrence 
relation ay4, = aerT <j, fork > 1. Find a recurrence relation satisfied by the 
sequence (b,),>1, defined for n > 1 by by, = 


4. Write a recurrence relation satisfied for each of the following sequences: 
(a) C1, 1,1,3,5, 9, 17,31, 57, 105, 183,...). 
(b) (1,2, 22,2”, 2?”,...). 
5. Below, we show the first four lines of an infinite array of natural numbers, such 


that, for i > 1, the i-th line starts with the number i and has two more entries 
than the (i — 1)-th line. Compute the sum of the entries in the n-th line. 


RWN 
aN 
nN 
an 
—~ 


3.1 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 
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. Show that the number 11... 1 (m digits 1) is equal to — 
. Compute the sum 1 + 11+ 111 +---+11...1 as a function of n. 


n 


. If (@,)g>1 is an AP with common difference r, prove that the sequence (bz)x>1, 


defined by by = ag,, — ag for every k > 1, is also an AP, and compute its 
common difference. 


. Let (a)x¢>1 be an AP of nonzero common difference, and p, q, u, v be given 


naturals. Prove that 
Ay + ag =A +d eo ptqaeutov 


Let (ax)z>1 be an AP of common difference r ¥ 0. If = is a nonnegative integer, 
prove that the sum of any two terms of the AP is also a term of it. 

Let (a;)x>1 be an AP such that a, = a and a, = B, with p ¢ gq. Compute ap+, 
in terms of p, q, a, B. 

(Romania) The sum of some (more than one) consecutive odd integers 
equals 7°. Find these numbers. 

The AP (ax)x>1 is formed by pairwise distinct naturals. Prove that it contains 
infinitely many composite naturals” among its terms. 

(Canada) Let a, be the sum of the first n terms of the sequence 


(0,1,1,2, 23:34 4h + 14 5. 


(a) Find a formula for a, as a function of n. 
(b) Prove that @m4n— Qn—n = mn for every natural numbers m and n, with 
m> n. 


(Canada) Show that the numbers V2, V3 and /5 cannot be terms of a 
single AP. 

Let (ax)x>1 be a GP with common ratio q. Prove that, for an arbitrary integer 
n> 1, we have 


n(n—1) 


a\a2...d, =ajq 2 


Compute the value of the sum 


in which the k-th summand from left to right equals the quotient between the 
k-th term of the AP 1,3,5,..., 99 and the k-th term of the GP 2, 27, 23,... ,2°9. 


According to the introduction to Chap. 1, we recall that a natural n > 1 is composite if we can 
write n = ab, for some naturals a, b > 1. 
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18. The sequence (a,)x>1 is an arithmetic-geometric progression if, for every 
integer k > 1, we have a, = bycy, where the sequences (b,)y>1 and (cx)e>1 
are respectively an AP and a GP. Let r be the common difference of the AP and 
q be the common ratio of the GP. Compute, as a function of n, b;, c,, r and q, 
the value of the sum of the first n terms of (a;)x>1. 

19. (Macedonia) In a nonconstant AP of real numbers, the quotient between the 
first term and the common difference is an irrational number. Prove that no 
three distinct terms of this AP form a GP. 

20. Compute, as a function of n, the n-th term of the sequence (a,)x>1, given by 
a, = 2 and ag41 = 2a, — 1, for every integer k > 1. 

21. The sequence (d,)n>1 satisfies aj = 1 and ag41 = 3a, — 1, for every k > 1. Do 
the following items: 


(a) If by = an — 5, prove that by41; = 3b; for every k = 1. 

(b) Write down the first five terms of the sequence (b;,),>1 and, then obtain a 
general positional formula for it. 

(c) Find a positional formula for ay. 


22. Prove that there does not exist a GP having the numbers 2, 3 and 5 as three of 
its terms. 
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At the end of last section, we saw that a sequence (a,),>1 is a GP if and only if it 
satisfies a recurrence relation of the form (3.4), i.e., if and only if 


+1 — gaz = 0, (3.6) 


for every integer k > 1, where g is a real constant. For reasons that will be clear in 
a few moments, we say that (3.6) is a first order linear recurrence relation with 
constant coefficients. There, we also saw that a sequence (a,,),>, is an AP if and 
only if it satisfies a recurrence relation like (3.3). 

In this section, our task is to study the more general class of sequences (a,),>1 
that satisfy recurrence relations like 


Akt2 + rag) + Sax = 0, (3.7) 


where r and s are given real constants, not both zero. If s = 0, (3.7) essentially 
reduces to (3.6); if s 4 0, it is said to be a second order linear recurrence relation 
with constant coefficients. 

Problem 5 explains what we mean by the property of linearity of sequences that 
satisfy recurrence relations like (3.6) and, more generally, (3.7). For such a sequence 
(@n)n>1, Theorem 3.16 will explain how to compute a, as a function of n. Before we 
present it, it is instructive to show the idea behind its proof in a concrete example, 
and we start by doing this. 
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Example 3.15 Let (@y)y>1 be the sequence such that a; = 1, ag = 7 and ay. = 
8ax41 — 15a,, for every integer k > |. Find a positional formula for a,,. 


Solution For every integer k > 1, we have 
Antz — 3ag41 = 541 — 3ay). 


Thus, the sequence (b,)x>1, defined by by = az41 — 3a, for every k > 1, is a GP of 
common ratio 5 and initial term b} = a7—3a, = 4. Hence, by = b,-5*~! = 4.5*-!. 
Analogously, for every integer k > 1, we have 


O42 — Sagi = 3(ax41 — Sax), 


so that the sequence (cx)x>1, given by cg = ae41 — Sax for every k > 1, is a GP 
of common ratio 3. Since its initial term is c} = dy — 5a; = 2, it follows that 
Ck =C\°- 3*-1 = 2. 3K1, 

Therefore, for every integer k > 1, we have the system of linear equations 


p41 — 3a,n = 4- 5k-1 
41 — 5a, = 2- a 


which easily gives 


geo art Vee 1, q 


The reasoning presented in the solution to the previous example can be easily 
generalized to deal with a general second order linear recurrence relation with 
constant coefficients. We do this now, referring the reader to Sect. 11.4 for a different 
approach. 


Theorem 3.16 Let (ay)n>1 be a sequence of real numbers such that 
+2 + rap+1 + Say = 0 


for every integer k = 1, where r and s are given real constants, not both zero. If the 
second degree equation x? + rx +s = 0 has real roots? a and B, then there exist real 
constants A and B, completely determined by the values of a, and ap, such that: 


(a) Ifa # B, then a, = Aa"! + BB", for every integer n > 1. 
(b) Ifa = B, thena, = (A+ B(n— 1))a""!, for every integer n > 1. 


Proof Recall from Proposition 2.16 that a + 6 = —rand af = s. Thus, (3.7) can 
be rewritten as 


Ay — (a + B)arri + (aB)axg = 0 


The case of complex roots is dealt with, in a much more general setting, in Chap. 21 of [5]. 
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or, which is the same, as 


Ak+2 — Mdag+1 = B(ae+1 — ax) 


or 


ag42 — Bary = &(ag41 — Bax), 


for every integer k > 1. 

Letting by = ay41 —@a, and cy = a+ 1 — Bag, it follows from the above relations 
that (b,)x>1 and (cx)x>1 are GP’s of common ratios respectively equal to 6 and a, 
and initial terms respectively equal to b} = ay — wa, and c; = a2 — Ba). Therefore, 
the formula for the general term of a GP gives 


by = (@2 — aa,) pe" and cy = (a2 — Ba,)o*! 


or, which is the same, 


A+ — Ady = (ay — aa) BX! 


Ak+1 — Bag = (a2 — Bay)ak! * (3.8) 


Now, let us first consider the case a # B. In (3.8), subtracting the first relation 
from the second gives 


az — Bay k-1 az — adj, 


k-1 
a, = : 
k= 3 7p 
letting 
A ay — Bay See a» en a 
a—p a—p 


we get the formula of item (a). 

In case a = f, relations (3.8) are equal and, at a first glance, it seems that we do 
not have enough information to compute a;. Nevertheless, we can use the following 
trick: it is immediate to verify that the sequences (ux),>1 and (vz)k>1, given by 
u;, = ot! and vg = (k — 1)a*! (the same a as before, root of the second degree 
equation x* + rx +s = 0), are such that 


Ug4+2 + U4) + SU = 0 
Vet2 + rvpsy + sup, = 0 


for every integer k > 1 (see Problem 4); therefore, for fixed A, B € R, the sequence 
Z, = Aux + Bu; is also such that 


Ze+2 + Ze+1 + SZ =O 
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(see Problem 5). Thus, the idea is to search for real numbers A and B such that, for 
every integer k > 1, we have a, = z. Since both sequences (ax)g>1 and (Zx)x>1 
satisfy identical linear second order recurrence relations, Problem 13, page 96, will 
assure that it suffices to find real numbers A e B for which a, = z; and a2 = Zp, 1.e., 
such that 


qQ= A 
a2 = (A+ B)a , 
This can obviously be done, fora = —5 # 0. oO 


In practice, we employ the formulas of the previous result in the following way: 
let (A;)n>1 be a sequence of real numbers satisfying the recurrence relation (3.7) for 
every integer k > 1, where r and s are given real constants, with r 4 0. We start by 
computing the real roots (if any) of the second degree equation 


xr+m+s=0, 


which is called the characteristic equation of (3.7). If the characteristic equation 
actually has real roots w and f, we then check whether a 4 f ora = f, and use the 
formulas of items (a) or (b) of Theorem 3.16, according to the case at hand. In order 
to find the real constants A and B in case a # B, we solve the system of equations 


a,=A+B . 
dy = Aa + BB’ 


in case a = B, we solve the system 


aq=A 
az = (A+ B)a : 


In both of these cases, the systems we have to solve are obtained by taking k 
respectively equal to 1 and 2 in the formulas of items (a) or (b) of the theorem. 


Example 3.17 Let us execute the above procedure to get a positional formula for the 
n-th term of the celebrated Fibonacci sequence,’ i.c., the sequence (F,,),>1 given 
by F, = 1, F2 = 1 and 


Frya2 = Froit Fr, (3.9) 


4 After Leonardo di Pisa, also known as Fibonacci, Italian mathematician of the XII and XIII 
centuries. As can be seen in Chap.1 of [5], the Fibonacci sequence plays a relevant role in 
Combinatorics. 
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for every integer k > 1. More precisely, let us show that 


(44) - (=) | ~Wn>1. (3.10) 


Solution The recurrence relation (3.9) can be written as Fy42—Fy41 —F, = 0 and, 


F, = —= 


“5 


hence, has characteristic equation x* —x— 1 = 0. Letting a = bias and 6 = ie 
be its roots, it follows from item (a) of Theorem 3.16 that 


F,, = Aq”! + Bp", 


with A and B chosen so that F; = F> = 1. 
Making n = 1| and n = 2 in the above formula for F,,, we get the linear system 


A+B=1 
aA+ PB=1— 


Multiplying the first equation by a and subtracting the second equation from the 
result, we obtain (a— 8)B = a—1. Now, sincea +f = landa—B = V5, we can 
write this last equation as /5B = —f, so that B = =FE Analogously, multiplying 
the first equation by # and subtracting the result from the second equation, we obtain 


A= a) so that 


a n—-1 B n-1 1 n n 
a! — =. ph! = 0" - B"). 
V5 V5 V5 o 
Our next example shows a situation where Theorem 3.16 can be applied, albeit 
not straightforwardly. 


F, = 


Example 3.18 Let (dy)n>1 be a sequence of real numbers satisfying the recurrence 
relation ay4, = ray + s for every integer k > 1, where r and s are given real 
constants, with r # 0,1 (we need not consider the case r = 1, for in this case 
(@n)n>1 1s an AP). For an integer k > 1, we have 


Ak+2 — TAg+1 = S = Ag+1 — rag 
or, which is the same, 
Ont2 — (r+ Waggs + ray = 0. 


This is a second order linear recurrence relation with constant coefficients, whose 
characteristic equation is 


xv —(r+)xt+r=0. 
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Since its roots are r and 1, it follows from item (a) of Theorem 3.16 that a, = 
Ar"—! + B. If we now recall that a7 = ra, +s, we see that A and B can be found by 
solving the linear system 


A+B=a, 
Ar+B=rajt+s" 


Therefore, A = a, and B = s, so that 
Qh =ar’ +s. 


Let us now turn our attention to third order linear recurrence relations with 
constant coefficients. As the reader has probably guessed by now, in this case we 
have a sequence (a,,),>1 satisfying a recurrence relation of the form 


Ak+3 + raz + Sax41 + tay = 0 


for every integer k > 1, where r, s and ¢ are given real constants, not all zero. 

As in the case of second order linear recurrence relations with constant coef- 
ficients, we define the characteristic equation of (3.11) as the third degree 
polynomial equation 


etretsxtt=0. 


According to Problem 21, page 43, such an equation admits, at most, three real 
roots. In what follows, we shall assume that it indeed has three such roots, let us say 
a, B and y. Then, we have the following result. 


Theorem 3.19 Let (a,)n>1 be a sequence of real numbers such that 


O43 + ray. + Sapo) + tay = 0 (3.11) 


for every integer k = 1, where r, s and t are given real constants, not all zero. If the 
characteristic equation x° + rx* + sx + t = 0 of (3.11) has real roots a, B and y, 
then there exist real constants A, B and C, completely determined by the values of 
a, a2 and a3, such that: 


(a) fa#BAy #4, thena, = Aa! + BB"! + Cy""|, for everyn> 1. 


(b) Ifa = B #y, then a, = (A + B(n—1))a""! + Cy""|, for everyn > 1. 
(c) Ifa = B =y, thenay, = (A+ B(n— 1) + C(n—1)?)a"", for everyn > 1. 


Proof We shall only sketch the proof, leaving the details to the reader (see 
Problem 6). 

Let (b,)n>1 be the sequence given by b, = a!) for every integer n > 1. Since 
a? + ra? + sa + t = 0, we also have aft? + rokt! + sak + ta*—! = 0 or, which 
is the same, 


bee3 + rbpy2 + shez + thy = 0, 
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for every integer k > 1. Hence, (b,),>1 satisfies the same recurrence relation as 
(4n)n>1 does. Analogously, the sequences (c,),>1 and (d,)n>1, given for an integer 
n> 1byc, = B"! andd, = y""!, also satisfy the same recurrence relation as 
(Qn)n>1- 

We now turn our attention to the cases (a)-(c): 


(a) For all A,B,C € R, the sequence (u,),>, such that u, = Aa’! + BB"! + 
Cy"! for every integer n > 1 satisfies the same recurrence relation as (4y)n>1 
does. On the other hand, since a # 6 # y 4 a, we can choose the constants 
A, B and C in such a way that uw) = aj, uz = a2 and u3 = ax. Therefore, 
by invoking once more the result of Problem 13, page 96, we conclude that 
Un = Gn, for every integer n > 1. 

(b) The result of Problem 21, page 43, gives us 


P+retsxtt=(«—ay(—-y). 


With the aid of this identity, it is easy to show that b, = (n — 1)a"! also 
satisfies the same recurrence relation as the sequence (a,),>1. Then, as in item 
(a), the sequence (u,,),>1 Such that u, = (A+ B(n— 1))a"! +Cy""! forn > 1 
satisfies that same recurrence relation, for all A,B,C € R. Moreover, we can 
again choose the real constants A, B and C such that vu; = a), U2 = a2 and 
u3 = a3. Thus, once more by the result of Problem 13, page 96, we conclude 
that u, = a, for every integer n > 1. 
(c) This time, the result of Problem 21, page 43, gives 


etrnet+sxt+t=(x«—-a)’. 


It is now easy to show that both sequences b, = (n — 1)o”! and c, = (n— 
1a"! do satisfy the same recurrence relation as the sequence (a;,)n>1. Hence, 
the same is true for the sequence u, = (A+ B(n— 1) + C(n— 1)*)a"“!, for all 
A,B,C € R. The rest of the argument proceeds exactly as in items (a) and (b). 
oO 


Example 3.20 Let (ay)n>1 be the sequence of real numbers given by a; = 1, az = 
4,a3 = 14and 


An+3 — OAg42 + 12axg41 — 8ax = 0, 


for every integer k > 1. Compute a, as a function of n. 


Solution The characteristic equation of the given recurrence relation is x° — 6x” + 
12x — 8 = 0. Since x — 6x + 12x — 8 = (x — 2)?, item (c) of the previous result 
gives 


a, = (A+ Brn—1) + CQ2— 1)’)- 271, 
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for every integer n > 1, where A, B and C are real constants chosen so that a; = 1, 
az = 4 and a3 = 14. These initial conditions, in turn, give us the linear system of 
equations 


A=1 
A+B4+C=2_, 
A+2B+4C=7/2 


whose solution is A = 1, B = 3/4, C = 1/4. Then, we obtain 


= 3 = sl — 4)2) 2.9271 
a = (14 30 n+l ?)-2 


= (n? +n+2)-23, 


Problems: Section 3.2 


1. Let (an)n>1 be the sequence given by a; = 1, az = 4 and ag42 = S5ag41 — 6a, 
for every integer k > 1. Compute a, as a function of n. 

2. Let (d,)n>1 be the sequence given by a) = 3, a2 = 5 and ago = 3ag41 — 2aK, 
for every integer k > 1. Prove that a, = 2” + 1, foreveryn EN. 

3. Let (dy)n>1 be the sequence given by a; = 3 and ag4; = 2ax — 1, for every 
integer k > 1. Compute a, as a function of n. 

4. * Tf the second degree equation x* + rx + s = 0 has two real roots equal to a, 
prove that the sequences uz, = a*~! and % = (k — 1)a*! satisfy the recurrence 
relations 


Ugt2 + rup41 + Suz = 0 
Vet? + VE] + SUZ = O 


(3.12) 


5. * The sequences (u,)n>1 and (U,),>1 Satisfy the recurrence relations (3.12), for 
every integer k > 1. For any fixed real constants A and B, prove that the sequence 
(Zn)n>15 given by z, = Aug + Bug, satisfies an analogous recurrence relation. 

6. * Fulfill the details in the proof of Theorem 3.19. 

7. * The Lucas sequence’ is the sequence (Zn)n>1 given by L; = 1, Ly = 3 and 
Ly+2 = [p41 + Ly, for every integer k > 1. Show that, for every integer n > 1, 
we have: 


(a) L, = a" + 6", where a = eee and B = lew 
(b) Lon = L2 + 2(-1)"1. 


5 After Francois Edouard Anatole Lucas, French mathematician of the XIX century. 
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(c) L? — 5F* = 4(—-1)", where (F,,),>1 is the Fibonacci sequence (cf. Exam- 
ple 3.17). 


8. (IMO shortlist—adapted) With respect to the sequence (a,),>1, such that a; = 1 
and 


1 
Gn+1 = 76! + 4a, + Vv 1+ 24an) 


for every integer n > 1, do the following items: 


(a) If b,» = V1 + 24a,, show that 2b,4; = b, + 3 for every n > 1. 
(b) Show that b, = 3 + s for everyn > 1. 
(c) Conclude that, for every n > 1, we have 


3.3. The > and II Notations 


In this section, we introduce the notations )> (one reads sigma) for sums and [| 
(one reads pi) for products, which will reveal themselves quite useful in the context 
of sequences. 


Definition 3.21 Given a sequence (ax)x>1, we write jal a; to denote the sum a; + 


dz +-++++ Gp, and read it as the sum of the a;’s, from j = | to n. Hence, 


n 


Soa = a{ ,ifn=1 
ln a.t+a+---+a,,ifn>1- 


j=l 


As a particular case of the above definition, if (a,),>1 is a constant sequence 
with, say, a, = c for every k > 1, we clearly have 


n 


n 
) a= ) c=ne. 
j=l 


j=l 


The main reason for the success of the & notation owes to the fact that it turns it 
quite easy to manipulate sums with large numbers of summands, specially when 
each such summand is itself a sum. For example, given sequences (a,),> 1 and 
(by)x>1 Of real numbers, the associativity and commutativity of addition of reals 
give 


(a) +z +--+ +n) + (b) + bz +++++ bn) = (a) £b1) + (Qn £ bz) ++ +++ (Gh Eby); 
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with the aid of the ) notation, the equality above can be written in the much more 
compact form 


at aa) Ge So) (3.13) 
j=l j=l j=l 


On the other hand, given c € R, the distributivity of the multiplication of reals with 
respect to addition gives 


c(a + a2 + +++ + Gn) = cay + cay + +++ + COn, 


an equality that can be written, again with the aid of the & notation, as 


4 = ya (3.14) 
j=l j=l 


The following example shows, in a simple case, how one could apply the two 
identities above in order to simplify the task of calculating the value of a sum. 


Example 3.22: Compute the value of )*;_ (2k + 1) in terms of n € N. 


Solution Successively applying (3.13), (3.14) and the first part of Example 3.7, we 
get 


2k + 1) = 5 2k+ sa Egy eas 
k=1 k=1 k=1 k=1 


1 
a2. OED pnan tm, 


oO 


The )> notation is particularly useful to perform cancellations in sums. More 
precisely, given a sequence (ax)x>1, and performing the possible cancellations in 
the sum 


(ay — a1) + (a3 — an) + (4g — 3) ++ + (Qn—1 — Gn—2) + (Qn — Gn-1), 


we get a, — a) as result. With the aid of the }~ notation, we can write the equality 
just get as 


n—1| 


YiG@u — dj) = a, — da. (3.15) 


j=l 
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An equivalent formula (obtained from the above by writing n + 1 in place of 7), 
which will sometimes be used in the place of (3.15), is 


YG — aj) = 4n+1— a]. (3.16) 


j=l 


Any one of (3.15) or (3.16) is known as a telescoping sum. The idea behind 
the name is the following: as a telescope shortens the immense distance between 
a celestial body and our eyes, the above formulas shorten the way between certain 
given sums and their results. 

Telescoping sums are the greatest advantage of having the )> notation at our 
disposal. Let us justify this claim by examining two interesting examples. 


Example 3.23 Deduce the formula for the general term of an AP with the aid of 
telescoping sums. 


Solution Let the sequence (a;)x>1 be an AP of common difference r. Then, aj+1 — 
aj = r for each integer j > 1, so that (3.16) gives 


n—1 


n—-1 
a, — a, = ) (41 —@) = Dor=(@-Ir 
j=l 


j=l 


or, which is the same, a, = a, + (n— I)r. oO 


To the next example, we say that a sequence (ax)x>1 is a second order AP 
provided the sequence (bx)x>1, given fork > 1 by bye = ag4i — ag, is a 
nonconstant AP. In order to build a second order AP (ax)x>1, we can start with a 
nonconstant AP, as 


(3.7,11,15, 19, 94, 97. 3, 


and stipulate the initial term of the second order AP, say a; = 2; then, we 
successively compute the values of a2, a3,... from the relations a2 — a; = 3, 
a3 — ad, = 7, a4 — a3 = 11 etc. Proceeding this way we obtain, from the AP 
given above, the second order AP 


(2,5, 12, 23, 38,57, 80,...). 


The discussion of the previous paragraph makes it clear that a second order AP 
is completely determined only if we know its first three terms. In fact, it is only by 
knowing its first three terms that we will know the first two terms of the nonconstant 
AP formed by the differences of the consecutive terms of the second order AP. 

We are now in position to use telescoping sums to deduce a formula for the 
general term of a general second order AP. 
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Example 3.24 Given a second order AP (a,)x>1, prove that 


(n—1)(n—2)r 


An = a, + (az —a))(n—1) + 5 


(3.17) 
where r = a3 — 2a + a, is the common difference of the nonconstant AP formed 
by the differences of the consecutive terms of (a;)x>1. 


Solution Let by = ax+1 — a for every integer k > 1. The formulas for telescoping 
sums and for the sum of the terms of a finite AP give us 


n—1 n—1 
(n— 1)() + bn-1) 
An — ay = YG — aj) = 7; = — +a 


j=l j=l 

On the other hand, by applying the formula for the general term of an AP, we get 
bp) = b) + (n—-2)r = (22 — aj) + (n— 2)r 

and, thus, 


(n— I + by-1) 

2 
_ (n— 1)(2(ay — a) + (n— 2)r) 
=a,t+ a ee — 


Aan =a, + 


=a, + (n—1)(ax—a1) + — 


Finally, it suffices to observe that 


r= bp — by = (a3 — a2) — (@ —Q) = a3 — 2a2 +. 4. 5 

In spite of the previous examples, if we wish to use the formula for telescoping 
sums to compute the sum es b; of the first n terms of a given sequence (b;)x>1, 
we first need to find a way to write the summands J; as the differences aj) — aj, 
between consecutive terms of some other sequence (a,)z>1. The difficulty with this 
reasoning relies upon the fact that we do not know, beforehand, who the sequence 
(ax)x>1 18. The discussion of some additional examples will help to clarify this point. 


Example 3.25. Consider the sequence (ax)x>1 given by a, = 1 and 


ak 


a = — , 
a 1+ ka 


for every integer k > 1. Compute a, as a function of n. 
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Solution Since the terms of the sequence (a;),>, are all positive (why?), we can 
define the sequence (b;);,>1 by setting by = z The given recurrence relation, then, 
gives, 


1 1ltka 1 
= kam th. 
aK 


Dep = = 
Ak+1 A 


Thus, by applying the formula for telescoping sums, together with the formula for 
the sum of the terms of a finite AP, we get 


by — by = (hs by) = aes — 


Therefore, 


Pi catip gt Seb) 2g 20-9) _ nw —nt+2 


2 9p 


and, hence, a, = oO 


oR 2 
bn ~ n2—n+2° 

For the next example define, for each positive integer n, the factorial of n, 
denoted n!, as the product of all natural numbers from | to n (by convention, 1! = 1), 
ie., 2! = 2,3! = 6,4! = 24,5! = 120 etc. Observe that, in general, we have 
(K+ 1)! = (kK4+1)-K!. 


Example 3.26 (Canada) Compute the value of the sum ea JG!) in terms of n. 


Solution In order to use the formula for telescoping sums, we have first to succeed 
in writing j(j!) as a difference aj, — aj, of consecutive terms of a single sequence 
(ax)x>1. We do this by observing that 


JY) =(1G+D-Wt=G+D--P= G4 Y!-j. 


Hence, by defining a, = k! fork > 1, we get 


SiG) = G+ D!-H) = Di G@i- 4) 
j=l j=l j=l 


=4Qrn41-—q =(n4+1)!- 1. oO 


As anticipated at the beginning of this section, we now present a very useful 
notation for products of real numbers. 


Definition 3.27 Given a sequence (a;)x>1, we write TTj- , 4; to denote the product 
a\a2...d,, and read the product of the a;’s, from j = 1 i j =n. Thus, 
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n . 

ay ,ifn=1 
ae ajd2...a,, ifn>1 
j= 


With the aid of the [[ notation, we can write the factorial of n € N (see the 
paragraph that precedes Example 3.26) by writing 


nl = Lf. (3.18) 


As was the case with the }~ notation, the usefulness of the [| notation comes 
from the fact that it formally commutes with multiplications and divisions. In fact, 
given a real number c and sequences (a,),>1 and (b,)x>1, we have 


(ay... dn) (bi bz... bn) = (a1b1) (G2b2) ... (Gnbn), 
a{a2...ayn a, a2 an 
bibz... by, b; bo by, 


and 
c"(ajaz...dn) = (Ca,)(caz) ... (Can) 


(provided all of the b;’s are nonzero, in the second equality above). Writing both 
sides of the identities above by means of the II notation, we obtain the relations 


IG [ [4 =| |[@p). 
j=l j=l j=l 


Tj=1 qj ; qj n : : 

vem b; = I] i and c Ila = [ [(ca). 
j=l j=l j=l 

Below, we present an example of application of such formulas. 

Example 3.28 Compute, in terms of n, the value of [];_, (2 + 2). 

Solution We apply the properties of the [| notation listed above: 


” 2 “  f{k+1 Merk + 
2 _ => 2 ———— = 9S SS 
M(2+z) = 12) rk 
k=1 k=1 
— yn @tD! _ 4.@+)-n! 
n! n! 


= 2"(n+ 1). 
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Analogously to the case of the )> notation, the [] notation is particularly useful 
when performing cancellations in certain products. This is the content of the formula 
for telescoping products, collected in the following result. 


Proposition 3.29 If (ax)x>1 is a sequence of nonzero real numbers, then 


ee sich (3.19) 


qj a\ 


Proof As with telescoping sums, it suffices to observe that the intermediate factors 
of the product at the left hand side all cancel. In symbols, 


n 
Gjt+1 _ 42 a3 a4 Gn Ant1 _ ntl 


i—| qj aq, a2 a3 an-1 an a 
I= oO 


Example 3.30 We take another look at the previous example, this time with the 
formula for telescoping products at our disposal. To this end, we first of all note that 


n 2 , n k+1 
Ne ee) 
k=1 k=1 


Now, defining the sequence (axz)x>1 by a, = k, it follows from (3.19) that 


te 2 fap An+1 
2+-—]=2" = 2) 1). 
II ( +7) IT ) (n+1) 


a a 
k=1 k 1 


Problems: Section 3.3 


1. Prove that a sequence (ax)x>1 is a second order AP if and only if ay42—2ay41+ 
ay ~ O and ag43 — 3ax42 + 34x41 — a, = 0, for every integer k > 1. 

2. Let (a%)x>1 be the sequence defined by a; = 1 and ay4+, = ay, + 3n — 1, for 
every positive integer n. Compute the n-th term of this sequence in terms of n. 

3. The sequence (a,),>1 is given by a} = 1 and ady41 = a, + 8n, forn > 1. 
Compute a,, in terms of n. 

4. * Compute the value of the sum }°7_, ca as a function of n € N. 

5. The sequence (a,)x>1 is an AP. Prove that, for every positive integer n, we have 

n—1 


1 n—1 


k=l Agak+1 a\an 


3.3 


11. 


12. 


13. 


14. 


15. 


16. 


17. 
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. Prove that, for every positive integer n, we have 


ee eee 1 1 
stotatots<2--. 
n n 


. Compute the value of the sum }77_ : GE=DGED as a function of n € N. 
. (Romania) Let k and n be positive integers. Prove that: 


(a) (2k+1)3—(2k—1)? can always be written as a sum of three perfect squares. 
(b) (2n + 1)? — 2 can always be written as a sum of 3n — 1 perfect squares. 


. Compute the value of the sum }°7_, ean as a function of the integern > 1. 
. (Brazil—adapted) Do the following items: 


(a) Fork € N, write (k + 1)? +k? + k?(k + 1) asa perfect square. 
(b) Compute the value of the sum par 4/ z + ar +1. 


* (Brazil) Let (F,)z>1 be the Fibonacci sequence, i.e., the sequence given by 
F, = 1, Fy = Land Fyy2 = Fei + Fr, for every integer k > 1. Compute the 
value of the sum )°7_, mts in terms of n € N. 

The sequence (ax)x>1 is an AP. Prove that, for every positive integer n, we have 


-1 
< n—1 


L Tat ia Vat Jan 


; 


(a) Factorise the expression xx +41. 
(b) Compute the value of the sum ene mort in terms ofn EN. 
(Australia) Compute the value of the sum 


999,999 
2 V+ D2 + Wan 


Compute, in terms of n € N, the value of the sum 


a 
at kh + DVR + RVR AT 


(Germany) Given a natural number n > 1, compute, as a function of n, the value 
of the product | ]j_, (1 - #) 


For 0 < k < 101, letx, = Compute the value of the sum 


ce 


101 3 
Xk 


= 1 — 3x, + 3x2 
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18. 


19. 


20. 


21. 
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(Leningrad) Compute, as a function of n € N, the value of the product 


For the next problem, we recall that the arithmetic mean A of a finite 
collection a), a2, ..., a, of real numbers is the real number 
a tant +H 
r ; 


A= 


(Macedonia) Find all values of n € N for which we can write the set A = 
{1,2,3,..., 4} as the union of n pairwise disjoint 4-element subsets, such that, 
in each of them, one element is the arithmetic mean of the other three. 
(China—adapted) Use the result of Problem 13, page 26, to compute the value 
of the greatest integer that is less than or equal to 


1 1 1 1 
S=—~4+——~4+ +--+ ——. 
JT af2-. «3 ./10,000 


* Given positive integers n and p, prove that: 


(k+ 1th pti 


(a) KP < oa < (k + 1)?, for every k EN. 
yeti 


bye <o < eae 


Chapter 4 
Induction and the Binomial Formula 


With the algebraic background of the previous chapters at our disposal, we devote 
the first section of this chapter to the development of a very important topic in 
elementary Mathematics, namely, the principle of mathematical induction. It will 
considerably improve our ability of elaborating proofs of mathematical statements 
that depend on natural numbers. In the other two sections, we shall apply this 
principle to deduce Newton’s formula for binomial expansion, as well as some 
important related results. 


4.1 The Principle of Mathematical Induction 


Generally speaking, there are several ways of proving something. For instance, 
we can make a direct proof, or a proof by contradiction (for more on Logic and 
proof techniques, see the first chapters of [21]). In this sense, the principle of 
mathematical induction will provide us yet another way of proving certain types 
of mathematical statements. 

To understand how this principle works, let’s consider a subset A C N such that 
1 € A. Also, suppose we know that, whenever a certain natural number k is in A, 
then, sok+ 1 also is in A. Therefore, 1 € A assures that 2 = 1+ 1 € A; accordingly, 
2 € A allows us to conclude that 3 = 2 + 1 € A. Continuing this way, we conclude 
that A contains all natural numbers, or, which is the same, that A = N. This intuitive 
discussion can be formalized as the following axiom, known as the principle of 
mathematical induction. 


Axiom 4.1 Let A CN be a set satisfying the following conditions: 


(a) 1EA. 
(b) KEASK+1EA. 


ThenA =N. 
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At this point, a natural question is: how can we apply the principle of mathemat- 
ical induction to prove something in Mathematics? In order to answer it, suppose 
there is given a property P(n) of the natural number n, which we wish to prove to 
be true for every n € N. Then, we let 


A= {k €N; P(&) is true} 
and observe that 
A=N S$ (P(n) is true for every n € N). 


Thus, in order to prove that P(n) is true for every n € N, it suffices to prove that 
A = N; yet in another way, by invoking the principle of mathematical induction, it 
suffices to prove that: 


e LEA; 
e* kKEADSK+1 EA. 


In turn, the definition of the set A assures that, showing the validity of the two items 
above is the same as showing that 


¢ P(1) is true; 
¢ P(k) true => P(k + 1) true. 


The previous discussion can be summarized as the following recipe for a proof 
by induction. 


Proposition 4.2 If P(n) is a property of the natural number n, then P(n) is true for 
every n € N ifand only if the following two conditions are satisfied: 
(a) P(A) is true; 
(b) P(k) true => P(k + 1) true. 
To understand how a proof by induction works in practice, let’s start by 


discussing two examples, the first of which was already considered in the previous 
chapter. 


Example 4.3 Prove that, for every n € N, the sum of the first n odd natural numbers 
equals n?. 


Proof Since the k—th odd natural number is 2k—1, the property P(n) is, in this case, 
the same as 


Pn): SOQj-) =n’. 
j=l 


As was said before, in order to prove by induction that P(n) is true for every n € N, 
we have to verify that: 
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i. P(A) is true. 
ii. P(k) true => P(k + 1) true. 
Checking i. is immediate: the first odd natural number is 1, which is the same 


as 1”. In order to verify ii., we have to assume that P(k) is true, i.e., that 


k 


YQi-D=k, (4.1) 


j=l 
and find a way to deduce from this that P(k + 1) is also true, i.e., that 


k+l 


YQ@- VD = (kt Y’. (4.2) 


j=l 


Since we are assuming that P(k) is true, we can substitute (4.1) into the left hand 
side of (4.2) to get 


k+1 k 
YQ@-) = VO@-1) + Akt) =K + A+) = (K+ 1). 
j=l j=l 
Hence, it follows by induction that P(7) is true for every n € N. Oo 


Example 4.4 Prove that, for every n € N, the sum of the first n perfect squares is 
equal to 


ann + 1)Qn+ 1). 


Proof Since the k—th perfect square is the number k?, the property P(n) is, in this 
case, 


Pn): >= ann + 1)(2n + 1). 
j=l 


As before, in order to compose a proof by induction, we have to verify that: 
i. P(1) is true. 
ii. P(k) true > P(k + 1) true. 
Again, verifying i. is immediate: 17 = Mie piet ey) To verify ii. let’s assume that 


P(k) is actually true, i.e., that 


k 
Yi7 = she + DOK+D, (4.3) 


j=l 
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and find a way to deduce that P(k + 1) is also true, 1.e., that 


k+1 
D7 = F&F K+ + LEH +1 (4.4) 


j=l 


Substituting (4.3) into the left hand side of (4.4), we obtain 


k 
YP =P +k+D? 


j=l j=l 


ake I(2k + 1) + (+1)? 


atk + 1[k(2k + 1) + 6(k + 1)] 
= atk + 1)(k + 2)(2k + 3). 


Hence, it follows by induction that P(n) is true for every n € N. Oo 


Below, we state (also as an axiom) a slightly more general form of the principle 
of mathematical induction, which gives it greater flexibility in applications. 


Axiom 4.5 Leta € NandA C {a,a+1,a+2,...} bea set satisfying the following 
conditions: 


(a) aéA. 
(b) KEASK+IEA. 


Then, A = {a,a+1,a+2,...}. 


As the reader may be guessing right now, the prototype application of this form 
of the principle of mathematical induction is to prove that a certain property P(7) is 
true for every natural number n > a, where a is a natural number given in advance. 
This can be done by letting 


A= {k EN; P(k) is true} 
and observing that 
A= {a,a+1,a+2,...} 
t 


P(n) is true for every natural number n > a. 


This way, we obtain the following more general recipe for proofs by induction. 
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Proposition 4.6 Leta € N. If P(n) is a property of the natural number n, then P(n) 
is true for every natural number n = a if and only if the following two conditions 
are satisfied: 


(a) P(a) is true; 
(b) P(k) true => P(k + 1) true. 


The next example illustrates the fact that, sometimes, this more general form of 
proof by induction is really necessary. 


Example 4.7 Prove that n! > 2”, for every natural number n > 4. 


Proof First of all, observe that we really need to start from n = 4, for the inequality 
n! > 2” is not true when n = 1, 2 or 3. 
The property P(n) we wish to prove is 


P(n): n!> 2”. 


In order to prove it by means of mathematical induction, we have to prove that 
P(4) is true and that P(k) true => P(k + 1) true. The validity of P(4) follows from 
4! = 24 > 16 = 2+. Let’s then suppose that, for some k € N, the property P(k) is 
actually true, i.e., that 


kl > 2 


From this, we wish to deduce the trueness of P(k + 1), i-e., that (k + 1)! > 2**1. 
To this end, we first use the trueness of P(k) to get 


(K+ 1)!=(k4+1)-k> (K+ 1-2; 
subsequently, we note that 
(c+ 1-2? eo" 


for every integer k > 1. Hence, by combining the two inequalities got above, we 
arrive at (k + 1)! > 2**!, thus concluding that P(k + 1) is indeed true. 

Therefore, it follows by induction that the property P(n) is true for every integer 
n> 4, oO 


Before we present another example, it is appropriate to make some comments 
on terminology: in a proof by induction, the step P(k) > P(k + 1) is generally 
called the induction step. In order to execute it, we assume that the property P(k) 
is indeed true (and this assumption constitutes our induction hypothesis) and, 
then, use this trueness, possibly together with other arguments, to deduce that the 
property P(k + 1) is also true. Thus, a proof by mathematical induction, as stated in 
Proposition 4.6, is usually done by executing the following steps: 
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¢ identification: isolate the property P(n) to be proved; 

* initial case: verify the validity of P(a); 

¢ induction hypothesis: assume the trueness of P(k). 

¢ induction step: deduce the validity of P(k + 1), with the aid of the induction 
hypothesis and (perhaps) other arguments. 


The statement of each problem generally makes it quite clear what the property 
P(n) is. Therefore, a proof by induction is usually centered around the three last 
steps of the above scheme. Moreover, it is also customary not to explicitly state 
either the property P(k) or the induction step P(k) = P(k + 1). The discussion of 
the next example is performed in this shortened way. 


Example 4.8 (Brazil) For every integer n > 2, show that there exist n pairwise 
distinct natural numbers whose sum of inverses equals 1. 


Proof Let’s make induction on n > 3, the initial case n = 3 following from 


Now suppose, by induction hypothesis, that, for a certain natural k > 3, there 
exist natural numbers x; < x. < +++ < x, such that 


1 1 1 
—+—4--+—-=1. 
XxX] X2 Xk 


Multiplying both sides of the equality above by ; and adding ; to both sides of 
the resulting equality, we obtain 


: ele : te : ht eok a 1 

3 Dee Dp xe 
Since 2 < 2x, < 2x. < +++ < 2x,, we have got k + 1 pairwise distinct 
natural numbers whose sum of inverses is also equal to 1, and this completes the 
induction step. Oo 


When applying the principle of mathematical induction to prove something, 
one must be very careful in the execution of the induction step, to avoid absurd 
conclusions. This point is better illustrated with the following classical example and 
the subsequent discussion on the reasoning we shall present. 


Example 4.9 Ina farm, if one horse is white, show that all horses are white. 


Proof Let’s “prove” the statement above by induction on the number n of horses. 
To the initial step, note that if a farm has just one horse and at least one of its 
horses is white, then surely all horses in the farm are white. 
By induction hypothesis, assume that in a farm with k horses, if at least one of 
them is white, then all are white. Then, consider a farm with k + 1 horses, also with 
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at least a white one. Take one horse from the farm, one that is not that we knew a 
priori to be white. Since we are left with k horses in the farm, at least one of which 
is white, it follows from the induction hypothesis that all of the k horses are white. 
Now, bring back the horse who was taken from the farm, and take from it one that 
was left at the first time. Again, we are left with k horses, at least one of which is 
white, so that, applying the induction hypothesis one more time, we conclude that 
all of these k horses are also white. However, since the horse we took from the 
farm this second time was already white, we conclude that all of the k + 1 horses 
are white. oO 


It is quite evident that there is some absurd in the “proof” presented above, for the 
statement of the example does not reflect the reality. The point is that we did not 
succeed in executing the induction step, for our reasoning does not work in a farm 
with two horses, as you can readily verify. 

There is yet another important form of the principle of mathematical induction, 
generally referred to as the strong principle of mathematical induction, or simply 
as strong induction. We describe it next. 


Axiom 4.10 Let A C N be a set satisfying the following conditions: 


(a) 1EA. 
(b) {1,...,.Kh} CAS kK+1€A. 
Then, A = N. 


At this point, the reader should probably find it clear how to use of the strong 
principle of induction to compose proofs. Nevertheless, in order to further illustrate 
it, we take a look at a couple of examples. 


Example 4.11 (OCS - adapted) Show that, for every n € N, the number (7 + 
4/3)" + (7 — 4/3)” is a positive even integer. 


Proof Letu = 7+4,/3 andv = 7—4V/3. Then, ut+tv = 14anduv = 1, from where 
it follows that u and v are the roots of the second degree equation x” — 14x ++ 1 = 0. 
It follows from this that w2 = 14u— 1 and v? = 14v — 1, and hence, for every 
integer k > 2, 


uk = 14u'! — yk? and v* = 140*7! — v?, 


Thus, letting s; = w/ + v/ and adding both relations above, we get, for every integer 
k > 2, that 


Sh = 14s,-1 — Sr-2.- 


Now, observe that so = 2 ands; = u+v = 14, both integers. Suppose, by 
induction hypothesis, that s, € Z for every integer 1 < k < n. Then, the recurrence 
relation above gives 


Sy = 14s,—-1 — Sn—2, 


from where we conclude that s,, being the sum of two integers, is also integer. 
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To what was left to prove, note that u,v > 0 guarantee that s, = u” + v” is 
positive for every n € N. On the other hand, again from the recurrence relation for 
the sequence (s;)4>1, we conclude that s, and s,—2 always have the same parity, i.e., 
they are either both even or both odd. However, since so and s; are both even, it 
follows again by induction that s,, is an even integer, for every natural n. Oo 


Example 4.12 Show that every natural number n can be written, in a unique way, 
as a sum of powers of 2 with nonnegative and pairwise distinct integer exponents. 
This way of writing n is called its binary representation or expansion. 


Proof Let’s make a proof by strong induction. For n = 1, we have 1 = 2°, and 
this is obviously the only possible binary representation of 1. Now, suppose that the 
desired result is true for every natural number less than a certain natural n > 1. 

We first show that n has at least one binary representation. To this end, take the 
greatest power of 2 which is less than or equal to n, say 2*. Then, 


9k <n< Dern 
so that0 < n—2* < 2* Ifn—2* = O, there is nothing left to do. Otherwise, 
1 < n—2* < n and, by induction hypothesis (here, we are assuming that every 
natural number less than 7 has a binary representation; therefore, we are making a 


strong induction argument), there exist nonnegative integers 0 < ay < aj <---< aq 
such that 


n—2 = 20 42 4. $2, 


Using once more the inequality n—2* < 2*, it follows that 2% +2% +---+2% < 2‘ 
and, thus, a; < k. Hence, 


n= 2 429 4... 4 2H 4 OK 


with 0 < ay < aj <--- <a) <k. 
We then show that the binary representation of n is unique. So, let’s assume that 


n= 2% 497 4...4.29 = 2% 49% 4... 4 DF 
with 0 < dg < a) <-+-<ajand0 < bo < by <--+ < by. Then, 


2 <2 49% 4...4 24 
=n= 242 4.2.42" 


Qt OY ben. OBI 


IA 


= 21], 
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where in the last passage we used the formula for the sum of the terms of a finite 
GP. It follows from the above that 2% < 2”'*! and, hence, aj < bj + 1,i-e., aj < bi. 
Changing the roles of a; and b; in the above computations, we analogously get b; < 
a; and, thus, a; = b;. Letting a; = b; = k, say, we obtain 


n— 2 = 2% 494 4...4 29-1 = 204991 4... 4 2b, (4.5) 


Since n — 2* < n, if we now invoke the uniqueness part of the induction hypothesis 
(i.e., if we assume that the binary representation of every natural number less than 
nis unique), then (4.5) gives j — 1 = 1— 1 and ap = bo, ay = Dy, ..., 4-1 = D-1, 
as we wished to prove. Oo 


Before you go on and try the proposed problems, there are two remarks that 
ought to be made. Firstly, if we want to prove the validity of a certain property P(n) 
of the natural number n, then using induction will not always be the best possible 
choice. For example, try Problem | and, after you are done, compare your proof by 
induction to the one given in Example 3.7. Secondly, apart from Problems 8 to 12, 
we do not present any applications of induction (and there are lots of them) to either 
Combinatorics or Number Theory. Instead, we refer the interested reader to [5]. 


Problems — Section 4.1 


1. Use the principle of induction to prove that the sum of the first 1 natural 
numbers equals ee 
2. Prove that, for every natural number n, one has 


n(n+ ny 


B42 4b = ( i 


3. Prove that, for every natural number n, one has 


i ve se td 1 ee 1 
2 3 mM-1 n n+l In—1" 


4. (Canada) Forn € N, let h(n) = 1+ 5 + z peer tH 4. Prove that 
n+h) + h(2) + h(3) +---+h(n— 1) = nh(n). 
5. Show that, for every integer n > 1, one has 
1 
1-24+2-34+---+(n-I)n= gt ae 1). 


6. Prove that, for every integer n > 1, one has 


1 
17437457 +---4+ Qn-1P = gon 1)2n(2n + 1). 
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11. 


12. 
13. 


14. 


15. 


16. 
17. 


18. 


19. 
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Given n € N and real numbers a), d2,..., dy, prove the general version of the 
triangle inequality: 


lay + az +++ + ay| < lay| + |a2| +--+ + Janl- (4.6) 


Also, show that if a1,a2,...,@n 4 0, then equality holds in (4.6) if and only if 
a1, 42,..., An have equal signs. 

Prove that 9 divides 4” + 15n— 1, for each natural n. 

Prove that 3 divides n? — n, for each natural number n. 


. Prove that 3” divides 43" ' — 1, for every integer n > 1. 


The result of the next problem is known as the fundamental principle of 
counting or as the multiplicative principle. 
* Show that, if we have n; ways of choosing an object of type 1, nz ways of 
choosing an object of type 2, ..., m, ways of choosing an object of type k, then 
the number of ways of choosing simultaneously one object of each of the types 
from 1 tok is nynz... ng. 
* Prove that a set with n elements has exactly 2” subsets. 
* Let (dy)n>1 and (by)»>1 be two sequences satisfying the second order linear 
recurrence relation with constant coefficients uz+2 + rug4) + sux = 0, for 
every k > 1. If aj = b, and ay = by, prove that a, = b,, for every n > 
1. Then, extend this result to the case in which both sequences (a,),>1 and 
(b;)n>1 Satisfy identical third order linear recurrence relations with constant 
coefficients. 
Let (a,)n>1 be a sequence of nonzero real numbers such that, for every n > 2, 
we have 

| n—1 


ajaj+1 aay 


j=l 


Prove that the sequence is an AP. 
(Bulgaria - adapted) * The sequence (a,,)n>1 is defined by a; = | and, fork > 1 
integer, aj41 = ae — ax + 1. Prove that, for every integer n > 1, we have: 


(a) Gn+1 = a1...ay + 1. 
(b) I 3 =2- =: 


Prove that, for every natural number n, we have 2?" > n", 

(Macedonia) Let x be a nonzero real number, such that x + x! € Z. Prove that 
x" +x" € Z, for every integer n. 

(Brazil) Let (%,)n>1 and (¥n)n>1 be sequences of real numbers such that, for 
every integer k > 1, we have xx4.1 = x2 —3xx € Yet = yp —3ye. If xt = y1 +2, 
show that x? = y, + 2 for every integer n > 1. 

(Putnam) Let (x,)n>0 be a sequence of nonzero real numbers satisfying, for 
every integer 7 > 1, the recurrence relation ies —Xn—1Xn+1 = 1. Prove that there 
exists a real number @ such that x4) = @x, — X,—-1, for every n € N. 


4.1 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


The Principle of Mathematical Induction 97 


The next four problems concern the Fibonacci sequence. We refer the reader 
to Example 3.17, for a review of its definition. 
Let (Fin)n>1 be the Fibonacci sequence. Prove that, for every n € N, the 
following identities are true: 


(a) Fy + Fo +++ + Fa = Foto —1. 
(b) FP + Fo +--+ F2 = FiF ny. 
(c) Fy + F3 + +++ + Fo = Fon. 
(d) Fo + Fates) + Foy = Foti — 1. 
(e) Fees — Fy,Fn+2 = (-1)’. 


Let (F;,)n>1 be the Fibonacci sequence. Prove that, for every m,n € N, with 
m> 1, we have Fintn = FinFn+i ae Fin—1F ny. 

Let (Fi,)n>1 be the Fibonacci sequence and (L,)n>1 be the Lucas sequence (cf. 
Problem 7, page 77). Given m,n € N, prove that: 


(a) 2Fim-tn = Fply + Frm. 
(b) 2L, tna = SPF n + LnLn- 


Let (F;,)n>1 be the Fibonacci sequence. 


(a) Prove that F,, = n? if and only if n = 1 or 121! 


(b) Ifa = naey do the following items: 


i. Prove that a” = F,,a + F,-1, for every integer n > 1. 
ii. Find all n € N such that a” — nq is an integer. 


(TT) Let (a,)n>1 be a sequence of pairwise distinct natural numbers. Prove that 
there are infinitely many naturals k such that a, > k. 

(France) Let (a,)n>+1 be a sequence of positive real numbers such that a; = 1 
and 


ay + ay t-++ +a) = (ay tag tess +a), 


for every integer n > 1. Show that a, = n, for every integer n > 1. 
* For a fixed real number a > 1, let (%,)n>1 be a sequence of real numbers such 


that fa <x, < /at+landx41 = 5 (% + “), for each integer k > 1. Prove 
that, for every integer n > 1, we have 


1 
Va <x < Vat 5. 


(France) Prove that, for every integer n > 5, there exist n positive integers such 
that the sum of the inverses of their squares is equal to 1. 


'More generally, it can be shown that F,, is a perfect square if and only if n = 1 or 12. For a proof 
of this fact, see the problems of Section 12.3 of [5]. 
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29. 


30. 
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Given m,n € N, with m > 1, prove that there exist unique nonnegative integers 
k, ao, Q1, ..., ax Such that 0 < a; < mfor0 <i<k, a, #0 and 


n= ay + ay—m'! Se apm +ajm-+ ao. 
In this case, we write n = (axay—1 ...1d0)m and say that the right hand side 
above is the representation of n in base m, or the m—adic representation of n. 
(Sweden) Prove that, for every natural number n, there exists a unique sequence 
(a;)j>1 of integers, such that 0 < a; <j for every j > 1 and 


n=a,-1!+a.-2!+a3-3!4+---. 


* Prove Zeckendorf’s theorem”: every natural number can be uniquely written 
as a sum of Fibonacci numbers of nonconsecutive and greater than | indices. 


4.2 Binomial Numbers 


We start by recalling the definition of fatorial, extended to the nonnegative integers. 


Definition 4.13 The fatorial of a given nonnegative integer n is the number n!, 
given by 


1, ifn =0 
n! = n 


ee: ifn>1° 


One could guess that it would be much more reasonable to define 0! as being 


equal to 0. Nevertheless, the reasons for letting 0! be equal to 1 will soon become 
evident. 


Definition 4.14 Given integers n and k, with 0 < k < n, we define the binomial 
number (7) by 


n\ _ n!\ 
k} Kn —k! 


It is easy to verify, directly from the definition, that 


Os (r= (2 


? After Edouard Zeckendorf, belgian mathematician of the XX century. 
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for every nonnegative integers 1 for which the binomial numbers above are defined. 
On the other hand, for every integers n and k such that 0 < k < n, we have 


3) 


Indeed, this follows at once from 


n\ _ n! _ n! _ n 
kK} kKn-k! (n—/Mn—-(n—h)! \n—k) 


Notice that the binomial numbers (5), ({) and (5), whenever defined, are all 


naturals (in the case of (5) = we this is due to the fact that the product of 
any two consecutive integers is an even number). This is also true in the case of 


(5) = mower for, among any three consecutive integers, there is always a 
multiple of 3 and an even integer, so that n(n — 1)(n — 3) is always a multiple 
of 6. On the other hand, (4.7) assures that (”) — (3). (,.",) = (7). (2) = (5) and 
(2) a (5). whenever defined, are also natural numbers. 

The discussion of the previous paragraph makes it natural to ask whether (7) is 
a natural number for every choice of integers n and k such that 0 < k < n. This is 
indeed the case, and will be a straightforward consequence of relation (4.8) below,* 
which is known as Stifel’s relation.* 


Proposition 4.15 Given integers n and k such that 1 < k <n, we have 


n\_ [n-1l fi n—1 (4.8) 

k} \ ok k-1} 
Proof It suffices to apply the definition of binomial number to the right hand side 
of the equality above, together with some algebraic manipulations: 


n—1 m-1\ _ = (-—1)! (n—1)! 

a aa | “f= G= iio! 
_ (n—1)! 1 1 
=o a G+) 


_ (n—1)! n 
~ (k-Din-—1-—b/!!kn—k) 


_ n! _ fn 
~ Kkin—-k! \ky 


3 Another proof is the object of Problem 8. 


4 After Michael Stifel, german mathematician of the XVI century. 
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With the binomial numbers we construct an infinite triangular numerical table, 
known as Pascal’s triangle,’ in the following way: we count lines and columns 
starting from 0, lines being labelled from top to bottom and columns from left to 
right; for 0 < k < n, the (n, k)—th entry, i.e., the number written on the crossing of 
the n—th line and the k—th column, is the binomial number (7). More precisely: 


¢ The entries of column 0, read from top to bottom, are respectively equal to the 
binomial numbers (i): (I; (i ). .... As we have already seen, all such numbers 
are equal to 1. 

e Line 0 is formed solely by the binomial number (5) = 1. Line | is composed by 

the binomial numbers (;) and (;). both of which are also equal to 1. 

¢ In general, the entries of line n, read from left to right, are respectively equal to 


the binomial numbers (6): (i). (5), we (”). 


o)? Mi) Xo 
According to the above recipe, we show below the first few lines of Pascal’s 
triangle: 


0 


Pascal’s Triangle 


With respect to Pascal’s triangle, Stifel’s relation says that, whenever we add, in 
line n — 1, the entries at columns k — 1 and k, we get the entry situated at line n and 
column k. This is more difficult to say than to understand and verify, and allows us 
to recursively get the numerical values of the binomial numbers (i). The table below 
shows the numerical values of the binomial numbers (7) for 0 < n < 6, obtained 
with the aid of Stifel’s relation. 


> After Blaise Pascal, french mathematician of the XVII century. Besides the triangle that bears his 
name, there is an important Pascal’s theorem in the theory of conics, which has greatly motivated 
the developments of Projective and Algebraic Geometry. 
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aow»r wn Fe 
a 
aw 
me 


More gerally, since (5) = | and (") = 1 whenever these numbers are defined, 
it is not difficult for the reader to convince himself or herself that (7) € N for all 
integers n and k such that 0 < k < n. We give a formal proof of this fact in the 


corollary below. 
Corollary 4.16 For all integers n and k such that 0 < k < n, we have (7) EN. 
Proof Let’s make induction on n > 0, the case n = 0 being obvious, for the only 
such binomial number is (°) SL, 

Now suppose, by induction hypothesis, that Ce) is a natural number for every 


0 <j < n-—1, and consider a binomial number of the form (7). There are two cases 
to consider: 


(i) If k = 0 ork = n, then we have already observed that () = (") = 1. 
Gi) If 1 < k < n—1, then it follows from the induction hypothesis that both ‘es 
and ae) are natural numbers. Hence, Stifel’s relation gives 


n n—-1 n—1 
= EN. 

Later (cf. Theorem 4.20), we shall see that one of the main utilities of Pascal’s 
triangle is that it provides an easy and quick way to write down the expansion of 
the binomial (x + y)” for small values of n. For the time being, let’s establish some 
useful identities relating the entries of a row, column or diagonal of it. 


The formula of the following proposition is known as the columns’ theorem of 
Pascal’s triangle. 


Proposition 4.17 In the column n of Pascal’s triangle, the sum of the entries in 
rowsn,n+1,...,2+k—1 is equal to the entry located at column n + 1 and row 


n+k. In symbols, 
k-1 
j k 
3 = @ ) (4.9) 
n n+1 
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Proof Let’s make a proof by induction on k > 1. The initial case k = | reduces to 
verifying that (”) = (a) which is immediate. 

By induction hypothesis, suppose that, when k = / > 1, (4.9) is true for all 
nonnegative integer values of n. Then, for k = /+ | and every integer n > 0, 


we have 
: n+j as n+j n+l 
j=0 j=0 
(ay es 
= + 
n+1 n 
_ n+1+1 
~ n+1 , 


where we used Stifel’s relation in the last equality above. 
Therefore, it follows by induction that (4.9) is true for every k € N and every 
integer n > 0. oO 


In the coming example, we employ the columns’ theorem to compute the sum of 
the first n perfect squares. 


Example 4.18 Given n € N, compute the value of 17 + 27 + ---+ n? in terms of n. 


Solution Letting S = }°_, 7°, we have 
n n n j n j 
S=)\jG-)+ j= 2» (') +> (‘) 
j=l j=l j=2 j=l 


Therefore, it follows from (4.9) that 
1 1 1 
S= ("3 es ("; = cn(n + 1)Qn +1). 


Given an integer n > 0, the diagonal n of Pascal’s triangle is formed by the 


binomial numbers 
n n+1 n+2 n+3 
oO}? 1 , 2 ; 3 aa 


With respect to them, the following corollary is the diagonals’ theorem of Pascal’s 
triangle. 


oO 
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Corollary 4.19 In the diagonal n of Pascal’s triangle, the sum of the entries in rows 


0,1,...,k—1 is equal to the entry located at row n+k and column k—1. In symbols, 
k-1 : 
k 
eg - ) (4.10) 
é J k—1 
j=0 


Proof Since () = Ve), it follows from the columns’ theorem that 
ale 2 sy) 2 Ge 7 ay 
= J 6 n n+1 k-1 


Problems — Section 4.2 


. Prove that (°") is even, for everyn € N. 

. Use the results of this section to compute the value of 17 + 23 +--+. + n3, for 
everyn EN. 

3. * Forn €N, prove that 


Ne 


if 1 is even, and 


© OB e>€ 


4. Prove that, for every integer n > 2, one has rhe (hk 
5. Given natural numbers k and m, with m > k, prove that 


ee et) he 
Lo Gyre 1 RH 


k 


6. Let n and k be given integers, such that 0 < k < n. Prove that 


(+ G--rer er) 
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7. Let (Fx)g>1 be the Fibonacci sequence (cf. Example 3.17). Show that, for every 
n € N, one has 


n—-j-1 
Fy | 
j=0 
where [) denotes the greatest integer less than or equal to a ie., 


n—1|_ § =, ifn is odd 
2 ~ —s if n is even © 


8. * Given integers n and k such that 0 < k <n, prove that the set {1,2,...,m} has 
exactly (7) subsets of k elements each. 


4.3. The Binomial Formula 


We shall now obtain Newton’s binomial formula,° i.e., we shall explicitly write 
(x + y)" as a sum of monomials of the form xy’. 


Theorem 4.20 Givenn € N, we have 


n 


@w+y"= >> ; xray), (4.11) 


j=0 
Proof Forn = 1, we have 


1\ 


1 
Qty =x+y=[) Jai’ + [_ Jy’. 


0 1 


Suppose, by induction hypothesis, that (4.11) is true for n = k, i.e., that 


4 eee 
@ty=)>> jp 


j=0 


®Sir Isaac Newton, english mathematician and physicist of the XVII century, is considered to be 
one of the greatest scientists ever. Actually, it is difficult to properly address Newton’s contribution 
to the development of science. Known as the father of modern Physics, Newton also created, 
together with G. W. Leibniz, the Differential and Integral Calculus. His masterpiece, Philosophiae 
Naturalis Principia Mathematica, is one of the most influential books ever written and contains 
the cornerstones of both Calculus and Physics. 
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Forn = k + 1, we thus have 


k 
(ty)! =@+yaty=e4+9)) (joy 


j=0 


k Bhi 4 P k goiyitl 
J J 


j=0 


> 


k 
0 


J 
k 


E 3 eee - 
— etl > tli > iriyitl yet, 


j=l 


In the last line above, let’s perform the following changes of indices: in the first 
>-, change j per / and, in the second )*, change j+ 1 per /; in this way, in the second 
> we have j =/—1and0<j<k-—1<¢1<1/<>k. Then, we get 


k 
(x+ a = kt ae :D (() al ie | ge oh ie ai 


1=1 
k 


att 4 ‘ eee 4 ye, 


1=1 


where we used Stifel’s relation in the last equality. 


Finally, since ) = () = 1, we can write the last line above as 


k 
k+ ') k+1 & ' ktI-Id (‘7 7 k+1 
x + x y+ y 
( 0 d k+1 


or, which is the same, 
k+1 
k+1 _ 
Pa ( ; ae ty. 
l=1 


This is exactly the expression we would like to reach, so that, by induction, we 
conclude that (4.11) is true for every n € N. oO 


Corollary 4.21 Givenn € N, we have 


(x—y)" = peli}. 
j=0 y 


Proof It suffices to apply (4.11), writing —y in place of y. Oo 
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rl) 


and say that such a monomial 7; is the general term of the expansion of (x + y)”. 
Hence, 


It is customary to write 


(x4 y)" = To +T, F---(41)"Th. 


In what follows, we collect some examples of application of the binomial 
formula, as well as some important consequences of it. 


Example 4.22 Find the least n € N for which the expansion of (xa/x + +)" has a 
summand not depending on x. For such an n, compute this summand. 


Solution In expanding the given expression, the general term is 


k 
Hf ped — 17) 3am -4e fe) eck 
(‘evs (=) = (‘) x sal (rt x 2, 


Therefore, there is a summand not depending on x if and only if there exists 0 < 
1k 


k < nsuch that 3n — 11k = 0. Hence, n = =z, So that the least possible n € N is 

n = 11, obtained for k = 3. In this case, the summand that doesn’t depend on x is 
ul 

() = 165. o 
3) 


Example 4.23 Let k € N and a,b,r € Q, with r > 0 such that ,/F is irrational. 
Prove that: 


(a) There exist c,d € Q such that (a + b./r)t = c+ dvr. 
(b) If (a+ br) = c + dv, with c,d € Q, then (a—b/*)k = c— dF. 


Proof 
(a) By expanding (a + b./r)*, we get 


(a+b yr = > () ai Vr + Jr Y* () aT, 


O<j<k O<j<k 
2 2ti 
Making 
k A k EF 
oe (j)eova and d= )> (i) over. (4.12) 
O0<j<k O<j<k 
al 2i 


it is immediate that c,d € Q (since V7 € Q for an even j, and V7! € Q for 
an odd j). 
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(b) It suffices to note that the expansion of (a — b./r)* is essentially equal to that 
of (a + b,/r)‘, the difference lying on the signs of half of the summands. More 
precisely, in the notations of (4.12), we get 


(aby = () aibidi — Jee (i) wv 


0<j<k O<j<k 
2 2ti 


=c-—dy’‘r. 


Example 4.24 Use Newton’s binomial formula to prove Lagrange’s identity:’ 


n 2 
n 2n 
= . 4.13 
¥()-(") — 


Proof Writing (1+x)*" = (1+x)"(1+x)” and applying Newton’s binomial formula 
to both sides, we get 


EC) -EOMEO)-L00 


Comparing the coefficients of x” in the first and last expressions above and 
using (4.7), we get 


(2)-E()()-E 0) 20) 


Item (a) of the following corollary is known as the lines’ theorem of the Pascal’s 
triangle. 


oO 


Corollary 4.25 Givenn € N, we have: 


(a) yy (‘) = 2" 
b o<jen (") = Yogjen (4) = Qt, 
(0) Seger () = Cogn) 


Proof For item (a), it suffices to set x = y = | in the formula for (x + y)”. 


Joseph Louis Lagrange, french physicist and mathematician of the XVIII century. Lagrange was 
one of the greatest scientists of his time, with notable contributions to Physics and Mathematics. 
In particular, he was a pioneer in the fields of Calculus of Variations and Celestial Mechanics. 
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In what concerns item (b), we start by setting x = 1 and y = —1 in the binomial 


formula to get (check it!) 
O0<j<n J 0<j<n J 
al 2ti 


Now, letting A = )“o<j<n (‘) and B= ¥ psp, ‘OF it follows from (a) and the 
27 2ti 
above relation that 
A+ B= 2" 
A-B=0 
Hence, we have A = B = 2”"!, oO 


Remark 4.26 It is possible to generalize the formulas of item (b) of the previous 
corollary in the following way: given integers 0 < r < k < n, we can compute the 


value of the sum 
7 + ‘ + ‘ + 
r k+r 2k+r 


in terms of n, k and r. For instance, for k = 3 we can compute, in terms of n € N, the 
values of the sums (6) + (3) + (6) +--+. (7) +) +G) +: and (5) + (5) + (@) +o 
The corresponding deduction uses the multisection formula, which will be obtained 
in Chapter 15 of [5], with the aid of complex numbers. 


We finish this small chapter with an example that illustrates the use of the lines’ 
theorem of Pascal’s triangle to the computation of sums. 


Example 4.27 Given n € N, compute the value of 
n re n 3 n toe n 
eee n is 
1 2 3 n 
Solution First of all, note that for j ¢ N we have 


n} n! 


fry ee ces 
i( ~/j@—=)l G-Dia—j! 


(n— 1)! n—1 
=> an = nN . 
G—-pia—pi"\j-1 
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Hence, it follows from the lines’ theorem that 


>¥(') 3 a = nr 


j=l 


Oo 
Problems — Section 4.3 
1. Given n ¢ N, compute the value of the sum }°7_» Le: 
2. Let A = Stig (")3* and B = 42h (",)11* If 2 = %, compute the 
value of n. 
3. Show that (1, 1)” > 345(n? + 19n + 200). 


11. 
12. 


13. 


. (OCM) Without directly evaluating all summands, compute the value of the 


sum 


1 1 1 1 1 1 
10! 318! SI6! 7!4t9I2t 1A 


. Find the maximal term in the expansion of (1 + ae 
. (Baltic Way). Let a, b, c and d be real numbers for which a? + b* + (a+ b)? = 


c? + d* + (c +d)’. Prove that at + b+ + (a+ b)4 = c+ + d*+(c+)'. 


. Given n € N, prove that (°”) > | 
. Inthe expansion of x(1+-.x)”, divide the coefficient of each term by the exponent 


7 a oe 
of x in that term. Prove that the sum of all these numbers equals uae 


. Given a € Z \ {0} andn € N, compute the value of the sums }7"_5 (‘) jai and 


Di=0 (i)i7 in terms of a and n. 


. The sequence (a;)x>1 is an AP. Prove that, for every integern > 1, the following 


identities are valid: 


(@) Dho (CY #4; = 0. 
(b) imo (j)(-D* "a7, = 0. 


*If0 <q < landn€N, prove that gq" < 


rar : 

Given a, b,n € N, with n > 1, prove that the number ./a + a/b is the root of a 
polynomial equation with integer coefficients and degree 2n. 

(Croatia) Let x, y and z be nonzero real numbers, such that x + y+ z = 0. Prove 
that the value of 


P+yptr 
xyz(xy + yz + zx) 


does not depend on the particular values of x, y and z. 
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14. 


15. 


16. 
17. 


18. 


19. 
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* Given n € N andj, k, / nonnegative integers such that j + k+/ = n, we define 
the trinomial number (eg by 


n = 
jk) plu 


Prove the trinomial expansion formula 


n : 
@+yt+ y= DO ( “ ire (4.14) 


j+k+l=n J 


where the sum in the right hand side extends over all possible choices of 
nonnegative integers j,k,/ such that7 + k+l =n. 

Use the result of the previous problem to expand the trinomials (x + y+ z)* and 
(xt+y+z)*. 

Compute, in the expansion of (1 +x+ 9) oF the term that doesn’t depend on x. 
Prove the following identities involving trinomial numbers: 


(a) ae ktl=n (aa) =3". 
(b) Doise =e == 


* Prove that, for every n € N, we have 


n 
n 
Po (5) Fn = Fyii—1, 


j=l 


where F;, is the k—th Fibonacci number (cf. Example 3.17). 
(IMO shortlist) Prove that 


y (-1)* (1991-k\ 1 
1991—k k ~ 1991" 


k=0 


Chapter 5 
Elementary Inequalities 


This chapter is an invitation to the systematic study of algebraic inequalities. More 
precisely, our main purpose here is to discuss some interesting examples of inequal- 
ities, for whose derivation we can use the simple mathematics we developed so far. 
Later, when we have the tools of Calculus at our disposal, we shall return to the study 
of algebraic inequalities, largely generalizing some of those we will study here. 


5.1 The AM-GM Inequality 


As will be clear in this section, what allows us to develop a systematic study of 
inequalities is the basic fact that the square of any real number is nonnegative, and 
it is zero if and only if the number in question is also equal to zero. 

To start with, for x, y € R we know that (|x| — |y|)? > 0, with equality if and only 
if |x| = |y|. If we expand the left hand side, we get the inequality |x|? + |y|? > 2|xy| 
or, which is the same, 


2 2 


2 


>= |xyl, (5.1) 


with equality if and only if |x| = |y|. 
If we now start with two positive real numbers a and b, and make x = fa > 0 
and y = /b > 0, it follows from (5.1) that 


b 
— Sab. (5.2) 


with equality if and only if /a = J, i.e., if and only if a = b. 
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The simplicity of the previous reasoning hides its importance. Actually, (5.2) 
is a particular case of a much more general inequality, known as the arithmetic- 
geometric mean inequality (see Theorem 5.7). For the time being, let’s see how to 
deduce other interesting inequalities from (5.2). 


Example 5.1 Given positive real numbers x and y, we have: 


(a) x++ ; = 2, with equality if and only if x = 1. 
(b) + -S > — with equality if and only if x = y. 
Proof 


(a) Applying (5.2) witha = x and b = t, we get 


with equality if and only if x = 1, ie., if and only if x = | (for, x > 0 by 
hypothesis). 

(b) Sincex+y> 0, we have + + t > t; if and only if (+ y) (4 +i)> > 4 or, 
which is the same, if and only i ie oa * > 2. In turn, this last inequality follows 
directly from that of item (a), with 4 = in id place of x. It also follows from (a) 


that equality holds if and only if = = 2, ie., if and only if x? = y*. However, 
since x, y > 0, such condition is cheats equivalent tox =y. 


For another proof of item (b), apply (5.2) twice and multiply the results: 


1 1 1 1 
wy (2+ 2) = 2va-2/2-— <4. 
x y x y 


It follows directly from (5.2) that we have equality if and only if x = y. Oo 


Example 5.2 Given positive real numbers x, y and z, prove that 
DA 2. 2 
ty te zxytxzt yz, (3.3) 


with equality if and only if x = y = z. 


Proof Applying (5.1) three times, we get the inequalities 


xr+y? w+ 77 y+2 


It now suffices to add these three inequalities to reach (5.3). 


If x = y = z, then (5.3) clearly becomes an equality. Conversely, if x # y (the 
x2 +y? 
2 


cases x # zand y # zcan be treated analogously), then > xy. Therefore, 
an obvious generalization of item (e) of Proposition 1.2 assures that, adding the 
inequalities 
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e+y poe ye 
xy, 5) 2 XZ, 


= yz, 


we get x7 + y* + 2? > xy + xz + yz. Hence, in order to have equality in (5.3), we 
must have x = y = z. oO 


The next two examples extend (5.2) to three and four positive reals. We start with 
the case of four positive numbers. 


Example 5.3 Given positive real numbers a, b, c and d, we have 


b a 
ee A abed. (5.4) 


with equality if and only ifa =b=c=d. 
Proof We already know that ae > /ab and td > /cd. Therefore, 


b a. 22.88 Jona Jed 
re = ——s > oe & al ab on = aed. 


where the last inequality above follows from yet another application of (5.2). 
Equality happens if and only if we have equality in all of the inequalities above. 
For the first one of them, we must have ath = Jab and ctd = Jcd, and, 
hence, a = b and c = d. For the second one, we must have /ab = Vcd: however, 
since a = b andc = d, this is the same as Va = V2, i.e., a = c. Therefore, we 
have equality if and only ifa = b=c =d. oO 


The previous example raises the natural question of whether a similar inequality 
is true for three positive real numbers. As we have already anticipated, this is indeed 
the case, albeit its deduction will not be as immediate as the one just discussed 
(nevertheless, see Problem 11). 


Example 5.4 Given positive real numbers a, b and c, we have 


+b+ 
oe ae (5.5) 
3 
with equality if and only ifa =b=c. 
Proof We apply the inequality of Example 5.4 to the four positive reals a, b, c and 


d = Vabc to get 


b ab — aver — 
Se hes abe-= VBd=d= Sabc. 


Then, a+ b+c+ VWabc > 4/abc or, which is the same, arbre > Yabce. 
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Fig. 5.1 Rectangular 
parallelepiped with 
dimensions x, y and z 


From the above computations it is clear that equality holds if and only if it also 
happens in the inequality 


3Y/ 
ss = Fy abc*/ abc. 


Therefore, it follows from Example 5.3 that equality holds if and only ifa = b = 
c= VJabc, ie., if and only ifa=b=c. oO 


At this point, the reader is probably formulating in his/her mind a natural 
generalization of inequalities (5.2), (5.4) and (5.5), which we anticipated to be true. 
Before we formally state and prove it, let’s see an interesting application of (5.5). 


Example 5.5 A box (i.e., a rectangular parallelepiped!) is such that the sum of its 
edge lengths equals 48 cm. If the box encloses the largest possible volume, find its 
dimensions. 


Solution Figure 5.1 shows a rectangular parallelepiped with dimensions x, y and 
z (measured in centimeters), such that the sum of the lengths of its twelve edges 
equals 48 cm. Since each dimension comprises four equal and parallel edges, this is 
the same as saying thatx + y+ z= 12. 

It is a well known fact that the volume V of such a parallelepiped is given by 
the formula V = xyz. Hence, algebraically our problem reduces to the one of 
maximizing the product xyz, under the restrictions that x, y,z > Oandx+y+z = 12. 
To this end, we apply inequality (5.5) to get 


3 3 
12 
Vance (“2*) -(F) = 64, 


thus concluding that V is at most 64cm?. Example 5.4 also shows that the volume 
equals 64cm? (i.e., that equality holds in the above inequality) if and only if x = 
y = zor, which is the same, if and only if x = y = z = 4cm. Therefore, the box of 
largest possible volume is a cube with edge length equal to 4cm. Oo 


'For the elementary facts on the notion of volume of solids, we refer the reader to [4]. 
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Before we state the promised generalization of (5.2), (5.4) and (5.5), we need 
some terminology. 


Definition 5.6 Given n > 1 positive real numbers a), a2,... , dn, We define their: 


(a) Arithmetic mean as the number 4(a, + a2 + +++ + dn). 
(b) Geometric mean? as the number 2/a)d7... dp. 


In the context of the previous definition, what we did in (5.2) and in Examples 5.3 
and 5.4 was to show that the arithmetic means of two, three or four positive reals 
are always greater than or equal to the respective geometric means, with equality in 
each case if and only if the given numbers are all equal. We establish the general 
case in the coming result, whose proof can be omitted in a first reading. As the title 
of this section suggests, inequality (5.6) below is known as the arithmetic mean- 
geometric mean inequality. 


Theorem 5.7 Given n > | positive reals a, a2,...,Qy, their arithmetic mean is 
always greater than or equal to their geometric mean. In symbols: 


a, +a, 4++**+ ay 
n 


> X/a\d2..., An, (5.6) 


with equality if and only if ay = az = +++ = dy. 


Proof Firstly, let’s prove by induction that the desired inequality is true whenever n 
is a power of 2, with equality if and only if a; = az = --: = a,. To this end, we have 
to verify the initial case n = 2 (which was already done along the discussion that 
established (5.2)), formulate the induction hypothesis (for n = 2/, say) and execute 
the induction step (i.e., deduce the case n = 2/+! from the case n = 2/). However, 
since 2/t! = 2 - 2/, it suffices to suppose the inequality to be true for any k positive 
reals (with equality if and only if these k numbers are all equal) and, from this, to 
deduce that it is also true for any 2k positive reals (with equality if and only if these 
2k positive reals are also all equal). Hence, to perform the induction step, consider 


2k positive reals a}, d2,...,€2,%. The induction hypothesis, together with the case 
n = 2, give 
1 2 fic k 
rom i= 5 pout ed atti 
j=l j= i 


1 
= 5 (i/a; 2 At &/ag41 ee. ar) 


> 4) ay... age/agyy ... Arq 
= Way... Apap... Ark. 
?The reader maybe find it useful if we observe that the adjective geometric attached to this number 


comes from the case n = 2. In this case, inequality (5.2) has a simple geometric interpretation, for 
which we refer to the problems of Sect. 4.2 of [4]. 
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To have equality, we must have it in all passages above. Then, it must be that 


att + ak Re 


——" k —* Kh 
= May... Ag, ———@@@_ = agg... x 
k k 


and 


k/q,.. dL Efi OE 
a,...dae+ Ak+ 1. ++ AK 7 ; 
rr = ay a Ak Ag+ | «++ Ak. 


For the first two equalities, it follows from the induction hypothesis that aj = 


+ = ay and ag+; = --- = yx. For the third one, we must have ¢/aq,...a, = 
‘/ag41... a2; this last condition, together with the two former ones, implies that 
we must have aj = --- = ay = ag+1 = +++ = ax. It is also evident (verify, 


anyhow!) that, if the 2k positive given numbers are equal, then equality must happen. 
Therefore, (5.6) is true by induction, with the stated condition for equality, whenever 
nis a power of 2. 

Let’s now prove, by strong induction, that the inequality is true in general, with 
equality holding if and only if all of the numbers are equal. To this end, let n > 1 be 
natural and aj, a2,..., dy be given positive reals. Take k € N such that 2k > n, and 
apply the arithmetic mean-geometric mean inequality to the numbers aj, a2,..., dn, 
together with 2 —n copies of the number a = 4/a;az...dn (thus, to a total of 
n+ (2* —n) = 2* numbers, in which case we already know that the inequality is 
true). We get 


ay Par ere. a 
2 - 


ok ok 
io ik. ke 
= a" a2 ae a =a. 


Therefore, it follows that a, + az +--+ + a, + (2* —n)a > 2*a and, hence, 


ik 
a, ...d,-a2—" 


a, + az +++ + an 
n 


> a= X*/a\a2... an. 


For the equality to hold, we already know that we must have a; = ay = --- = 


a, = a = +++ = a; in particular, all of the numbers aj, a2,...,a, must be equal. 
Finally, it’s immediate to see that, if all of the numbers a, az, ..., a, are equal, then 
equality holds. Oo 


The following corollary generalizes item (b) of Example 5.1. For an alternative 
proof, see Problem 21. 


Corollary 5.8 Given n > | positive reals ay, az,...,An, we have 


i, 1 1 5 
(pape) | ee ee (5.7) 


a a2 an 
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with equality if and only if ay = az = +++ = dy. 
Proof Applying (5.6) twice, we get 


i 4 1 
(as + ay ++ a) tote )> 


a\ a2 an 


In order to have equality in (5.7), we must have equality in (5.6), and we know that 
this implies a; = az = --- = ay. Conversely, it is immediate to verify that, if all of 
the n given numbers are equal, then we have equality in (5.7). oO 


Remark 5.9 The inequality (5.7) is sometimes referred to as the arithmetic mean- 
harmonic mean inequality. This is due to the fact that it can be written as 


ay tan tetany | (Wetve rt ules)" 


n n 


and that the right hand side above (i.e., the inverse of the arithmetic mean of the 
inverses of a), a2, ..., Gn) is known as the harmonic mean of a), a2, ..., Gn. 


We now illustrate the use of the inequality on the arithmetic and geometric means 
in three examples. 


Example 5.10 (Israel-Hungary) Let k and n be positive integers, with n > 1. Prove 


that 
: + : feet ! > nf kt I 1 
—a, eee — a n —_—_ _—— i 
kn = kn+1 kn+n-—1 k 


Proof It suffices to see that 


n—1 n—1 n—-1 : 
1 1 kn+j+1 
— 1 = —————— 
mat Ga ) 2 kn +j 


j=0 j=0 


- Te a 
knt+j k 


j=0 


where we’ve applied the inequality on the arithmetic and geometric means once. 


Note that, since the numbers oe are pairwise distinct, equality never holds. O 
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Example 5.11 (APMO) If a, b and c are positive reals, prove that 
b b 
(1+) lee (1+=)>2 ieee), 
b c a Jabc 


Proof Expanding the left hand side, we obtain 


a b Cc atc b+c a+b 
(1+=) ities (1+=)=24 n + , 
b Cc a b a Cc 


and it suffices to prove that 


atc b+c a+b_ 2%a+b+c) 
+ --— 2 
b a c abc 


Letting S denote the left hand side of the last expression above, it follows 
from (5.6) and (5.7) that 


3 


1 1 1 
S=(4+b+0(-+542)-3 
a boc 
Bee ey eee Cee ee ee eee 
=a P a bee re . a bee 
2. 3 1 
= Fat o+a( J+ 5-9-3 


abc 
2(a+b+c) 
abc ; 


oO 


Example 5.12 Gabriel has a sheet of cardboard of 2 m by 3 m. In order to assemble 
an open box, he cuts four equal squares from the corners of the sheet, folds it along 
the cuts and glues the lateral faces of the box along their common edges. If the box 
is to have the largest possible volume, what should be the length of the sides of the 
squares he cut? Justify your answer. 


Solution If x is the common length of the sides of the cut squares, then one must 
clearly have 0 < x < 1. Since the box has height x and its bottom is a rectangle of 
side lengths 2 — 2x and 3 — 2x, the volume Gabriel wants to maximize depends on 
x and equals (2 — 2x)(3 — 2x)x. 

One possibility for him is to try to apply the inequality between the arithmetic 
and geometric means to get rid of x and, then, see what the condition for equality 
says about the size of x. However, he cannot do this directly, for, although 

(2 — 2x)(3 — 2x)x < (ate aay, 


the expression at the right hand side still depends on x. 


5.1 The AM-GM Inequality 119 


Nevertheless, the following trick does the job: he starts by choosing positive reals 
a, b and c such that a(2—2x) +b(3—2x) +cx doesn’t depend on x and such that there 
exists at least one value of x € (0, 1) for which a(2 — 2x) = b(3 — 2x) = cx. This 
amounts to finding a positive solution (a, b, c) for the linear equation 2a+2b—c = 0, 
such that this solution, in turn, gives equal solutions for the first degree equations 
2(a—b)x = 2a—3b and (2a+c)x = 2a, which should belong to the interval (0, 1). 
Hence, we should have 


2a + 2b fg ay, 
=c and ——— 
. CT asp) a aSe 


If we succeed in me | a,b,c > O satisfying the given equations, we will 


automatically have 54 ¢€ (0, 1). To what is left to do, substitute c = 2a + 2b into 
the second equation e ga <— =i = 75 sath OF, which is the same, 2a*—2ab—3b? = 0. 


Therefore, £ is a positive solution of the second degree equation 2u” — 2u — 3 = 0, 
so that . = eg It thus suffices to choose b = 2,a = 1+ J/7andc = 2a+2b = 
6 + 27. 

With these choices of a,b,c at hand, Gabriel can successfully implement the 
heuristic reasoning of the second paragraph of the proof. Writing V for the volume 
and recalling that 2a + 2b = c, we have 

abc V = a(2 — 2x)b(3 — 2x)cx 


7 ec 2x) + b(3 — 2x) + ay 


3 
= (= + =, 
= q : 
Equality holds if and only if a(2—2x) = b(3—2x) = cx. However, we already know 
that these two equations have x = = as common solution, so that the maximal 
possible volume is attained only for x = xp = ora == =v , and equals 
1 /2a+3by3 10+ 7V7 
— = xo(2 — 2x9)(3 — 2x9) = ——-—_. 
— (=F) = x02 — 2¥0)(3 — 2x0) = —S 


Oo 


Later, when we have the methods of Calculus at our disposal, the previous 
example will fall into the general framework of the analysis of the first and second 
variations of the function V(x) = (2 — 2x)(3 — 2x)x; as such, it will have a 
straightforward solution. Nevertheless, the solution we presented above, of choosing 
adequate weights prior to applying the inequality between the arithmetic and 
geometric means, albeit quite tricky, is an instructive one for several other situations. 
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Problems: Section 5.1 


1. * Generalize item (a) of Example 5.1. More precisely, prove that, if x is a 
nonzero real number, then |x + +| > 2, with equality if and only if |x| = 1. 


2. Given positive reals a and b, prove that . + ul > a’ +b’. When does equality 
hold? 
3. Given real numbers a < b < c, prove that the equation 


1 1 1 


x-a x-b x-c 


=0 


has exactly two distinct real roots. 
4. (Brazil) Let a, b and c be positive real numbers. Prove that 


(a+ b)\(a+c)>2Vabc(a+b+c). 


5. (USA) Prove that, for every positive real numbers a, b and c, one has 


1 1 1 1 
Ta. Fuca Lf es a 
wt+b+abe b+et+abe c+a+abce™ abc 


For the next two problems, the reader shall need some Euclidean Geometry, 
which we review next.* More precisely (cf. Fig. 5.2), if a = BC, b = AC and 
c = ABare the lengths of the sides of a triangle ABC, then there exist x, y,z > 0 
such thata = y+z,b=x+zandc = x+y. In fact, it suffices to take x, y 
and z as being equal to the lengths of the line segments determined on the sides 
of ABC by the points of tangency of its incircle. In the context of inequalities 
involving the lengths of the sides of a triangle, the substitution of them by y+ z, 
x + zandx + y is frequently referred to as Ravi’s transformation. 

6. (IMO) If a, b and c are the lengths of the sides of a triangle, prove that 


abc = (a+b—c)(b+c—a)(c+a-—Db). 


Fig. 5.2 Ravi’s 
transformation 


3For a thorough discussion of the facts that follow, see Sect. 3.4 of [4], for instance. 


5. 


10. 


11. 
12. 


13. 


14. 


15. 


16. 


17. 
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. Let a, b and c be the lengths of the sides of a triangle. Prove that 


a b Cc 


——_ + — — + — —_ = 
b+c-a cta-b a+b-c 


. (Baltic Way) Let a, b, c and d be given positive reals. Prove that 


at+c b+d_ cta d+b 
a+b b+e c+td d+a™7 


4. 


. If0 <x 4 1 andnis a positive integer, prove that 


_ x2nt 


; > (2n+ 1)x". 


(England) Prove that 3a* + b* > 4a°d, for all nonzero real numbers a and b, 
with equality if and only if |a| = |b| and ab > 0. 

* Prove directly (i.e., without appealing to (5.5)) that a? + b* + c? > 3abce. 
(Soviet Union) Let a, b and c be positive reals. Prove that 


(ab + ac + be)? > 3abc(a+b+c). 
(Soviet Union) If x, y, z > 0, prove that 


e yy 2 yp zg x 
St+54+52-4+-4+-. 
eo ee 


Let a and b be given positive reals. Prove that 
9a +b>+)>(atb4+c)’. 
Given positive reals a, b and c, prove that 
at(1+b*) + b4(1+c*)4+ (1 +a) = 6a°b*e’, 


with equality if and only ifa=b=c=1. 


Let a), dz, ..., dn be given positive reals. Prove that 
ay a2 a3 Qn-1 a 
— $f Ht pt Sn. 
a2 a3 a4 an al 
Let n > 1 be an odd integer and a), az, ..., a, be negative reals. Show that 


a; +a. +---+ ay, 
n 


< Y/a|a2... an, 


with equality if and only if all of the a;’s are equal. 
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18. 


19. 


20. 


21. 


22: 


23: 


24. 
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(BMO) Prove that, for every natural n, one has: 


(a) (n+ 1)" > 2"nl. 
(b) (n+ 1)"(2n + 1)" = 6"(n!)?. 


(Slovenia) Let x be a positive real number and m be a natural number. Prove that 


x(x + I(e+ 2)... m—1) > mixltatate ta, 


(Poland) If x1, x2, ..., X, are positive reals whose sum equals S, prove that 


S a: S oes S . n- 
S— x S— x2 S-x, n-—-l 


’ 


with equality if and only if all of the x;’s are equal. 
The purpose of this problem is to present an alternative proof of inequality (5.7), 
one that doesn’t make use of (5.6). To this end, do the following items: 


(a) Show that (7, ai)(, 4) = 2+ Dig (4 + 4): 
(b) Apply item (a) of Example 5.1 to each sum “ + “, with i < j, to get (5.7). 
qj Gi 


Prove the weighted arithmetic-geometric mean inequality: let a), az, ..., dn 
be positive reals and kj, kz, ..., k, be positive integers whose sum of inverses 
equals 1. Prove that 


ky kp kn 
Lp > aay... 
2 ee ee ea eee ae 
ki ko kn : 
with equality if and only if aj = ay = -:: = a,. (Note that the case ky = 
kKg=e =k, = 1 corresponds to the usual inequality between the arithmetic 


and geometric means.) 
(Romania) Let n > 1 be an integer and 0 < a; < dy <-:: < a, be given real 
numbers. Prove that 


a\ a2 an a\ az — a a3 — a2 Qn — An-1 


Under what conditions does equality occur? 
(China) Given positive reals a, b and c, prove that 


a+Vab+ Jabe _ , (S)(*) 
——————_—_— a ———— _]. 
3 - 2 3 
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5.2 Cauchy’s Inequality 


As a further application of the ideas of the previous section, let n > 1 be an 
integer and a1, a2, a3, bi, bo, b3 be real numbers such that aj + a3 + aj = 1 
and b} + b} + b} = 1. Since x? + y? > 2|xy| for all x,y € R, with equality if and 
only if |x| = |y|, we have 


a, + bi = |aibi|, 
a3 + b5 > \arbo], (5.8) 
az + b3 > |a3bs|, 


with equality if and only if |a,| = |b], |a2| = |b2| and |a3| = |b3|. Adding the left 
and right hand sides of the above inequalities, we get 


(aj + a5 + a3) + Of +5403) = Gi +h) + G+ 5) + G+ 43) 
> 2(\aib1| + |arb2| + |a3b3|) 
> 2layby + arbz + azb3], 


where in the last step we applied the triangle inequality for three real num- 
bers, (2.11). 
Hence, it follows from a? + a3 + a} = 1 and b} + b3 + b3 = 1 that 


|a,b, + dobo + a3b3| <1. 


Equality holds if and only if it holds in the three inequalities (5.8), as well as in 
the triangle inequality. Therefore, equality holds if and only if |a;| = |bi|, |a2| = 
|b2|, |a3| = |b3| and either a,b), azb2, a3b3 > 0 or ayb,, azb2, a3b3 < 0. However, 
it is immediate to check that such conditions are equivalent to a} = bi, dy = bz and 
a= b3. 

Now, consider arbitrary real numbers aj, a2, a3 and b,, bz, b3, such that at least 
one of a1, az, a3 and at least one of bj, bz, b3 are nonzero. For a positive real number 
c, let x; = & for 1 < i < 3. Since 


Dy? i. 42 
ay +a, + 43 


2,242 
XY + Xy 1X3 = 2 


’ 


we have 


mtotyeleocaVa+agt+a. 


Analogously, letting y; = a for 1 < i < 3, withd = a BF + bs + b2, we have 
yityst+ y3 = |. Therefore, it follows from our previous discussion that 
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Ix1y1 + x2y2 + x3y3| < 1, 
with equality if and only if x; = y; for 1 <i < 3. 
Substituting the definitions of x; and y; in the above inequality and recalling that 


c,d > 0, we conclude that the last inequality above is equivalent to 


|a,b, + azb2 + a3b3| Z 


<1, 
cd 
with ce = ,/aj+ a +a, andd = ,/b} +b} +b}. Moreover, equality holds if 
and only if a; = $b; for 1 < i < 3. Finally, note that the last inequality above is 


equivalent to 


cd = ai +45 +a5/b, + b5 + 0}. 


Up to this point, what we have done was to establish, form = 3, the famous 
Cauchy’s inequality.* We now turn to the general case. 


|a,b, + anb2 + a3b3| 


lA 


Theorem 5.13 (Cauchy) Letn > 1 be an integer and a, dp, ..., Ayn, bj, bo, ..., Dn 
given real numbers. Then, 


(5.9) 


with equality if and only if the a;’s and b;’s are respectively proportional, i.e., if and 
only if there exists a nonzero real number i such that 


aq, = Ab, a2 = Abo, weeny An = Abn. 


Proof {f all of the a;’s or all of the b;’s are equal to zero, there is nothing to do. 
Otherwise, in order to establish (5.9), it suffices to follow the steps of the particular 
case n = 3 discussed above. The only difference is that, whenever convenient, we 
have to use (4.6) instead of (2.11) (see Problem 2, page 127). oO 


Later (see Example 6.18), we shall give another proof of Cauchy’s inequality 
as an application of the theory of maxima and minima of quadratic functions. For 
a geometric interpretation of Cauchy’s inequality for n = 2, see the problems of 
Sect. 6.3 of [4]. 

The coming two examples illustrate how one can apply Cauchy’s inequality. 


“Augustin Louis Cauchy, one of the greatest mathematicians of the XIX century, and maybe of 
History. Cauchy was one of the precursors of Mathematical Analysis, an extremely important area 
of higher Mathematics. He also has his name attached to several important results in Differential 
Equations and Mathematical Physics. 
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Example 5.14 Let a, b and c be given real numbers. Show that the system of 
equations 


3? +yY+27%+e074+bh+c =6 
ax+ by+cz=2 


doesn’t have any real solutions x, y, z. 


Proof By contradiction, if there existed a real solution x, y,z, we would have, by 
Cauchy’s inequality, 


4=(axtbytey?<(@4+PV4 PC +y4+2). 
Thus, letting uw = a* +b? +c? andv = x*+y*+2’, we would have u+ 3v = 6 and 
uv => 4. At this point, the inequality on the arithmetic and geometric means would 
give us 


6 =ut 3u > 2Vu- 30 = 2V3u0 = 2V3-4 = 4V3, 


which is an absurd. oO 


Example 5.15 (Romania) Let x1,x2,...,Xn+41 be positive reals satisfying x) + x2 + 
+++ +X, = Xn41. Prove that 


Vx On41 — x1) eee + Xn(X%n+1 —Xn) X< 
S V%n41 An41 — Xi) He + Ang Ont1 = An). 
Proof For 1 <j <n, let yj = x41 —x;. By Cauchy’s inequality, we have 


Jay tet nn Seb Fin Fo Fn 


= Sent Ont — x1) +++ + Ont1 — Xn). 


Oo 
For future use, we collect the following corollary of Cauchy’s inequality. 
Corollary 5.16 Given real numbers a,,...,d, and b,..., by, we have 
(5.10) 
with equality holding if and only if ay, ..., a, and by, ..., by are positively 


proportional, i.e., if and only if there exists a positive real number 4 such that 
a; = Ab; for\ <i<n. 
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Proof For the sake of clarity, let’s prove the corollary for n = 3, the general case 
being totally analogous. Since both sides of (5.10) are nonnegative real numbers, 
it suffices to show that the square of the left hand side is less than or equal to the 
square of the right hand side. In symbols, 


2 
(a, + bi)? + (ay +b)? + (a3 +3)? < (vai t+aztazt+ Voi +b5+ i) : 


Expanding (a; + b;)’, it follows that the square of the left hand side equals 
(ay + 2ayb, + bt) + (a3 + 2arbz + b3) + (aj + 2azb3 + b3). 


Analogously, the square of the right hand side equals 


(afta ta)+2wWae+aGa+a/b+bh+b3+ 674+ 55 +54). 


Cancelling the summand (ay + a; + a3) + (bj + bs + b3) from both sides, we 
conclude that (5.10) is equivalent to 


2(a,b, + anb2 + a3b3) < 2Vat + as + ay bi + bs + b?, 


which is precisely Cauchy’s inequality. 
The analysis of the conditions for equality will be left as an exercise to the reader. 
oO 


As will be seen in Chaps.8 and 13 of [4], form = 2 and 3 inequality (5.10) 
has the following geometric interpretation: in a cartesian coordinate system, if O = 
(0,...,0), A = (a),...,@,) and B = (b),...,b,), and we set C = A+B, then 
C= (a, + bj,...,a, + b,) and (5.10) amounts to the inequality 


C < OA+ OB, 


where XY stands for the length of the line segment XY. This is the same as the 
(geometric) triangle inequality for the (possibly degenerated) triangle OAC, and for 
this reason (5.10) is also known as the triangle inequality. 

Note also that, if nm = 1 and we make a; = a and b, = b, then (5.10) reduces to 
V(a +b)? < Va? +~/b? or, which is the same, |a+b| < |a|+|b]. This is the reason 
why (5.10), as well as its generalization (4.6), are known as triangle inequality. 
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Problems: Section 5.2 


1: 


2: 
3. 


Given real numbers x and y such that 3x + 4y = 12, compute the minimum 
possible value of x? + y’. 

Prove the general case (5.9) of Cauchy’s inequality. 

Given positive reals aj, a2, ..., dy, we define its quadratic mean as the number 


at+ate +a 


Prove the inequality between the quadratic and arithmetic means: 


[ane eee (5.11) 
n = ia 


with equality if and only if aj = ay =--- = dy. 


. (IMO shortlist) Let a), a2, a3, a4 be positive reals. Prove that 


2 2 2 
a; +; + a 


aj + aj + ag 


>= a, + a2 + 434+ a4, 


1<i<j<k<4 


with equality if and only if aj = ay = a3 = ag. 


. Use Cauchy’s inequality to give a third proof of (5.7). 
. (Leningrad) Given positive reals a, b, c and d, prove that 


. (OIM) Let x, y and z be positive reals whose sum equals 3. Prove that 


379 < J2x+34 /2y4+34 V2z43 < 3V5. 


When does the right hand inequality become an equality? 


. Letn > 2 be an integer. Prove that 


. (Soviet Union—adapted) Given x, y, z > 0, apply Cauchy’s inequality to prove 
that 
re yp 2 y Z 
ac Ta Sia Wks Gee sa 
z+ 22 + 4 ey z 


with equality if and only if x = y = z. 
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10. (APMO) Let aj, a2, ..., a, and bj, bo, ..., b, be given positive reals, such that 
ay tant-:- +a, = bd; + bo +--+» +b,. Show that 


11. (TT) Let a), ado, ..., a, be given positive reals. Prove that 


a a a 
(1+) (1+ 2)...(14 “) > (1+ a1)(1 + a)... (1+ ay). 
a2 a3 a\ 


12. (APMO) Let a, b and c be the lengths of the sides of a triangle. Show that 
Vatb—c+VJb+c-at+ve+a—b< Jat Vb + Je 


and explain when equality occurs. 
13. Let ay, az, ..., dn, D1, bo, ..., by and c), C2, ..., Cy be given positive real 
numbers. Show that 


(Son) 5) 5) 9) 


14. (IMO) Let a, b and c be positive reals such that abc = 1. Prove that 


1 1 1 


3 
——— a 
a(b+c) ~ b3(a+c) v C(at+b)~ 2 


5.3. More on Inequalities 


This section is devoted to the study of other important elementary inequalities. The 
first of them is generally attributed to the Bernoulli brothers,° being known as Ber- 
noulli’s inequality. In spite of its simplicity, we will see that it is quite useful in 
applications. 


Proposition 5.17 (Bernoulli) fn is a natural number and x > —1 is a real number, 
then 


(1 +x)" >1+nx, 


with equality holding for n > | if and only if x = 0. 


5Jacob and Johann Bernoulli, Swiss mathematicians of the XVIII century. 
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Proof Let’s make induction on n, the case n = 1 being immediate. Suppose, by 
induction hypothesis, that (1 + xk > 1+ kx; since 1 + x > 0, we have 


(+x! =(1+xy( +x > (14+ x)(1 +k) 
=1+(k+lxtkhe >1+(k+ Dx, 
with equality if and only if (1 +x)* = 1+kx and kx” = 0, ie., if and only if x = 0. 
Oo 


Example 5.18 Given a natural number n and positive reals a and b, show that 


n b\" 
(1+=) +(1+2) ca a 
b a 


with equality if and only if a = b. 


Proof Dividing both sides of the inequality by 2”, we see that it suffices to prove 


that 
: 1 he a n a ; 1 in b n 7 
2 2b 2 jy 7~~ 


Since —4 + >-—land —} + z > —1, if we apply Bernoulli’s inequality to both 
summands at the left hand side above and add the results, we get 


‘ es are 6 eae es Ga 2 i 
2° 2b 2° Gq) —" "\ ob fa) 
Now, it suffices to apply the inequality between the arithmetic and geometric means 
to obtain 


ob 


with equality if and only if > = 5-, i.e., if and only ifa = b. Oo 


We continue by presenting an inequality known in the literature as Chebyshev’s 
inequality.° 


Theorem 5.19 (Chebyshev) I/f a), daz, ..., d, and b, bo, ..., by are real numbers 
such that 


@, Sa) <--> Sa, and bi by <---> < dy, 


® After Pafnuty Chebyshev, Russian mathematician of the XIX century. 


130 5 Elementary Inequalities 


then 


i=1 


1 n 1 n 1 n 
(: Ys) (; 2 ] < 7 Dai 


with equality holding if and only if ay = az = +++ = a, ou by = by = +++ = Dy. 


Proof We have to show that 


nSan- (Sa) (Som) 20 


i=1 


and for this it suffices to observe that the expression at the left hand side equals 


> (a; — aj)(bi — Bj), (5.12) 


ij=l 


which, in turn, is nonnegative (for, the a,;’s and b;’s are equally ordered). 

Now, if a) = a2 =--- = a, or bj = bp = --- = by, it is immediate to check that 
equality holds in Chebyshev’s inequality. Conversely, suppose that equality holds 
in such an inequality. Then, the argument of the previous paragraph assures that the 
expression in (5.12) must be equal to zero. Since (a; —a,)(bi—b;) = 0 for all indices 
1 < i,j <n, we must have (a; — aj)(b; — bj) = 0 for all 1 < i,j < n. If there exists 
1 < k < nsuch that by < by4), then Db} < +--+ < De < Deey < +++ < by, and the 
condition (a;—ax41)(bi—bxe41) = 0 for every 1 < i < n gives a; = ay4, forl <i< 


k. Hence, a, = a2 = --- = ag = ay+1. If we now start from (a; — ax)(b; — by) = 0 
for k < i <n, we conclude in a similar way that ay; = +--+ = ay. Therefore, all of 
the a;’s must be equal. Oo 


The following corollary collects an important consequence of Chebyshev’s 
inequality. 


Corollary 5.20 [fk is a natural number and aj, a2, ..., dy are positive reals, then 


(5.13) 


n 


a +aztetay (ater sa) 
ee 


Moreover, if k > 1, then equality holds if and only if all of the a;’s are equal. 


Proof Let’s prove the inequality by induction on k > 1, noting that (5.13) is trivially 


true for k = | and all positive reals a,, ..., a,. By induction hypothesis, let / > 1 be 
a natural number such that (5.13) is true for k = / — | and all positive reals a1, ao, 
wey Ape 


Since both sides of the inequality we wish to prove are invariant under permu- 
tations of the indices 1, 2, ..., m, we can suppose, without any loss of generality, 
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that a) < a) < +++ < ay. Then, a)! < aS! <.--- < at, and it follows from 
Chebyshev’ inequality that 


1 n i 1 n 1 n i 
= S(2 > ayl= ees 5.14 
ss = bee (:4) (; =i : 


On the other hand, induction hypothesis gives 


I-1 
1 n icp 1 n 
a eal lee i ‘ 5.15 
pee (GE) 


and if we combine these two inequalities we get 


I-1 i 
1 n 1 n 1 n l n 
eae (eVEe) GE) 


i=1 


Finally, let 7 > 1 and suppose we have equality in (5.13) when k = J. Then, 
the argument of the previous paragraph assures that we must have equality in (5.14) 
and (5.15). However, by the condition for equality in Chebyshev’s inequality, the 
only way to have equality in (5.14) is having ay = --- = dp. oO 


The two coming examples illustrate how one can apply Chebyshev’s inequality. 


Example 5.21 (Poland) Let aj, a2,...,@, be positive reals with sum equal to s. 
Prove that 
a, a2 an n 


+ peed >—. 
S—-a S—a2 S— dy n—-1 


Proof Suppose, without loss of generality, that a; < a2 < ++: < a,. Then, s — 
a, > s—ay > +++ > §—a, and, since s—a; > O for every i, it follows that 
<  <...< —. Therefore, Chebyshev’s inequality gives 


Say" 


n a; n 1 1 n n 1 
— ve a . 


= i=1 


(5.16) 


On the other hand, it follows from Corollary 5.8 that 


n n 1 
(s6 _ «) (>: 2 =) on 
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However, since }7_, (s — a;) = (n — 1)s, this last inequality yields 


n 1 2: 


n 
> : 5.17 

dca * Gs ( ) 

If we now combine (5.16) and (5.17), we arrive at the desired inequality. oO 

Example 5.22 (Turkey) Letn > 1 be a natural number and x1, x2,...,X, be positive 


reals such that }77_, x7 = 1. Find the least possible value of 


a 

i=] “1 Se oe ec 

where the hat over x; at the denominator of the i-th summand indicates that it 
contains all of x), x2,...,Xn, except xj. 


Proof Let s denote the sum of the x;’s and suppose, without loss of generality, that 
Xp SXy S++) Sy. Then, x} <5 <---< and 4 << cH. 


The expression to be minimized can be written as 


Hence, if we apply Chebyshev’s inequality twice, together with the inequality 
between the arithmetic and quadratic means (cf. Problem 3, page 127) and 
Corollary 5.8, we get 


IV 
S| 
4 ‘ 
uJ = 

mA 
| — 
s 
———— 
4 > 
ii M = 
s 
LE 
< ‘ 
uJ = 
2h 
cea” 
i) 


IV 
5, 
Pp, 
Ly 
an 
[hs 
s 
——— 
IV 
5, a 
= 
N 
a ih 
ll = 
as 
“A 
| 
s 
— 
— 
| 
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In order to guarantee that WELD is the least possible value, we ought to show 
that it is attained. To this end, it is enough to see that, in view of the constraint 
>=1+7 = 1, the condition for equality in Chebyshev’ inequality gives x) = x. = 

ee : : ‘ 
Se if equality holds. However, computing with these values for x, 
..+5Xn, We See that all of the inequalities above become equalities, so that Pach 1S, 


indeed, the least possible value. oO 


Next, we present an inequality known as the rearrangement inequality. We 
recall that a sequence (x;,x2,...,X,) iS a permutation or a rearrangement of 
(a\,42,...,d,) if these two sequences differ only by the order of their terms.’ 


Proposition 5.23 Let aj < dz < +++ < a, be given positive real numbers. If 
(x1, X2,...,Xn) is any permutation of (a), d2,...,4n), then 


n—1 n—1 n—1 
i= i= i= 


with equality in the left (resp. right) inequality above if and only if x; = ay—j (resp. 
x; = aj), forl <i<n. 


Proof Letus show how to maximize the sum a,x; +d2x2+:+++4,X%,. (The argument 
to minimize it is completely analogous.) 

Since the number of permutations® (X1,%2,..-,Xn) Of (a1, 42,...,@y) is finite, 
there is at least one of them which maximizes the sum a,x; + dx. +--+ + GpXp. 

If (bj, b2,..., bn) is such a permutation, we want to show that b; = a; for 
1 <i<n.To this end, it suffices to show that b} < bz < --- < b,. By contradiction, 
suppose that there exist indices 1 < i < j < n for which b; > bj. Define a 
permutation (b/, b5,...,b/,) of the a;’s by setting 


by, se k¥#i,j 
b= 4b; se k=j 


bj, se k=i 
Then, 
n n 
ST a;b' = a ajb; = (ab; =F ajb') = (ajb; + a;bj) 
i=1 i=1 
= (ajb; + ajbi) _ (a;b; + ajbj) 
= (aj _ aj) (bj _ bi) > 0. 
7In the more precise language of functions (cf. Chap.6), we say that (x,,x2,...,%,) is a 
permutation of (a),a2,..., Gn) if there is a bijection g : {1,..., n} —> {1,..., n}, so that 


Xj =dgiy,forl <i<n. 
8 Actually, it is easy to show that there are exactly n! such permutations. For a proof, see [5], or 
provide one yourself, by making an induction argument. 
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This is the same as 


abi an arb a ee and’, > ayby + agbp +++ + ayby, 


which, in turn, contradicts the fact that (b1,b2,...,bn) is a permutation of 
(a), d2,...,@,) that maximizes the sum a,x; + a2x2 + +++ + GyXxy. Therefore, 
by < bp < +++ < dy. oO 
With essentially the same argument as above, one can easily extend the rear- 
rangement inequality to the case in which a; < a) < --- < ay. In this case, if 
(X1,%2,...,X,) is any permutation of (a), a2,..., 4»), then 
n—-1 n—-1 n—-1 


i=1 i=1 i=1 


with equality in the left (resp. right) inequality above if (and no more if, and only 
if) X; = An—i (resp. x; = a;), for 1 < i < n. The coming example explores this more 
general form of the rearrangement inequality. 

Example 5.24 Given positive reals a, b and c, show that: 

(a) C+ R40 >0b+b'c4+ ca. 

wy ec bebe dh, 


Proof 


(a) Suppose, without loss of generality, thata < b < c. (Other orderings of a, 
b and c would give rise to analogous arguments.) A direct application of the 
rearrangement inequality gives 


CPAP Se CEO be eS a bE et Ore 


(b) The symmetry of both sides with respect to a, b and c allows us to suppose again 
that a < b < c. The inequality to be proved is equivalent to 


a’be + ab*c + abc” < (ab)” + (be)? + (ca). 
In order to get this, let’s first observe that the condition 0 < a < b < c implies 
ab < ac < bc. Therefore, upon applying the general form of the rearrangement 


inequality, we obtain 


abc + ab*c + abc? = ab-ac + ab- be +.ac-be 
< (ab)* + (bc)? + (ca)’. 
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A very useful idea in certain types of problems involving inequalities is to try to 
use arguments similar to the one of the proof of the rearrangement inequality. Let’s 
see an example along these lines. 


Example 5.25 (Taiwan) Let n > 2 be an integer. Compute the greatest possible 
value of the expression 


D> xxii + %), 


l<i<j<n 


when (x1, X2,...,Xn) Varies over all sequences of nonnegative reals such that x; + 
Xa: +x, = 1. 


Solution Let 


En(%1,.--,Xn) = yy XjxXj(Xi + 2) 


l<i<j<n 
and observe that the symmetry of the right hand side allows us to suppose that 
4 SP Sy = 0. 
Let’s first of all show that 
En(%1,.+-,Xn—2>Xn—1 + Xn, 0) > En (1, ... Xn). (5.18) 


To this end, set yj = x; for] <j <n—1, y,-1 = X,-1 +X, and y, = O, and note 
that y) + yo +-:++ y, = 1. For the sake of notation, let AE,, denote the difference 


AE, = En(y1,---5¥n) — EnQ1,--+5%Xn)- 


Then, we get 
n—2 
AE, = Y_ yi ty) + Do yin-107 + Yn) 
l<i<j<n—2 i=1 


n—1 
+ >> yoni + yn) — y XiXj(Xi + Xj) 
i=l 


l<i<j<n—2 


_ 2 XiXn—1 (Xj + Xn—1) — se XiXn (2%; + Xn). 


l<i<j<n—2 l<i<j<n-1 
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Taking the definition of the y;’s into account, we obtain 


n—2 
AE, = SiGe + Xn) (x; + Xn-1 + Xn) 
i=1 


n—2 n—-1 


= S02 xitn—1 (i + Xn—1) — D> xixn (Ai + Xn) 


i=1 i=1 
n—2 
= So biGn—-1 a Xn) (Xj + Xn-1 + Xn) — XiXn-1 (x; + Xn-1) 
i=1 
= XiXp (Xi =F Xn) = Xn—1Xn(Xn-1 + Xn) 


n—2 
= ) 2XiXn 1Xn — Xn 1Xn(Xn 1+ Xn) 


i=1 


_ Xn—1Xn (2X1 fees + 2p 2 — Xn-2 — Xn). 
Since xj > x2 >-++ > xX, => 0 andn = 3, it perspires that 


2x1 = ai 2Xn—2 —Xp-1 — Xn = 2x1 —Xn-1 — Xn 


= X1 —Xp-1 +X —Xy, = O, 


which, in turn, establishes (5.18). 

Thus, in order to maximize E,,(x,,...,X,), we can restrict our attention to the 
sequences (x), x2,...,X,) of nonnegative reals for which x, + +--+ X,-1 + %, = 1 
and x, = 0. In this case, it is immediate to see that 


En (1, -.+5%n—1,9) = En-1 (01, .--Xn-1)s 


with x1,...,X,—1 nonnegative reals for which x; +--+: + x,-1; = 1. Then, we can 
repeat the same argument and, doing this several times, we conclude that we can 
suppose x; = 0 fori > 2. Therefore, it is sufficient to maximize 


Ey(x1,X2) = x1X2(%1 + x2), 


for nonnegative x; and x2 satisfying x; + x2 = 1. This, in turn, is immediate: 


2 
+ 1 
X XQ (x1 + X2) = x1XQ < (=) = ri 
Finally, we have 
E,( )<E € :) = 1 
n\X1,--+,Xn) S £2 2 ~ 4 
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The last inequality we wish to consider is due to the XIX century Norwegian 
mathematician N. H. Abel. For this reason, it is known as Abel’s inequality. 


Theorem 5.26 (Abel) Letn > 1 be a natural number and ay, do, ..., An, b4, bo, 
..., Dy, be given real numbers, with a, = dy = +++ = ay = 0. If M and m respectively 
denote the maximum and minimum elements of the set of sums {b,,b, +b2,...,b, + 
by +---+ by}, then 


ma, < ayby + agbo +--+ + dnb, < May. 


Proof Let’s prove the right hand side inequality, the proof of the left hand side one 
being totally analogous. 
Make so = 0 and s; = b} +---+ 5; for 1 < i <n. Then, 


n—-1 


n n n 
> abi = > aj(s; — si) = 5 aj Sj — ) Qj+18i 
i=1 i=1 i=1 i=0 


n—1 
= SG = Git 1)Si + AnSn 
i=1 
n—-1 
< YS M(ai — aj41) + May, = May. 
i=1 oO 


For future reference, and in the notations of the proof of Abel’s theorem, the 
identity 


n n—1| 
Yo aids = YG = aig1)8i + an5y (5.19) 
i=1 i=1 


is usually referred to as Abel’s identity. As we shall see here and later (see 
Problem 18, page 243), it is almost as useful as the inequality itself. 

We finish this section by presenting a beautiful application of Abel’s inequality. 
In the course of the proof, we will use the fact that every set with k elements 
has exactly 2 distinct subsets. For a proof of this last fact, we refer the reader to 
Problem 12, page 96, or to Chap. 1 of [5]. 


Example 5.27 (Romania) Do the following items: 


(a) Letn > 1 be an integer and x),...,X, y1,...,¥n positive reals such that x,y, < 
X2y2 <+++ < x,y, and, forl <k <n, xj +-++ +x, > yi +°++ + yg. Prove that 


1 1 1 1 1 1 
— ee ee Se a 
Xx] X2 Xn v1 y2 Yn 
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(b) Let A = {aj,a2,...,Gn} be a set of positive integers with the following 
property: the sums of the elements of any two nonempty subsets of A are always 
distinct. Prove that 


1 1 
— ep OD 
a\ a2 an 
Proof 
(a) First of all, observe that 
31 Gl Oxy 
yi --y eye (5.20) 
m1 et i 


On the other hand, the condition x; +---+2%% >yi +-::-+y,forl<k<n 
can be rewritten as 


(x) —y1) +--+ + Ox — yx) = 0 
for 1 < k < n. Thus, making a; = = and b; = x; — y; for 1 < i < n, we have 


a >a>°+:>a,>0,b,4+:: ee, > Oand = = ajbj, for 1 <i <n. 
Appling Abel’s inequality to (5.20), we obtain 


n 1 n 1 
S> == So = = ay minfd, +--+ + Bi; 1<i<n}>0. 
| Yi ay Xi 

i=1 i=1 


(b) Suppose, without loss of generality, that aj < ay < -+: < d,, and let By = 
{a,,..., ax} for 1 < k <n. The hypothesis on the set A assures that all of the 
2* — 1 nonempty subsets of B; have distinct sums of elements. However, since 
each of these sums is a natural number and a; +---+ a, is the greatest of them, 
we conclude that 
A So 
Now, observing that 2 — 1 = 29+ 2! + --- 4+ 2'"!, we have 
Ay tant tay, > 2°42) 4---+ 2h! 


for 1 < k <n. On the other hand, it is obvious that 


2a, <2!ay <--- <2” a, 
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so that the inequality of item (a) gives 


a ee ee ee eae ee eee, 
a\ a2 an 29 2! Qn-l 


For another proof of item (b) of the previous example, see Chap. 3 of [5]. 


Problems: Section 5.3 


1. Given n € N, prove that (1 + +)" < (1+ a ae 
2. (USA) Given naturals m and n, let a = ey Prove that 


q” + a" > m”™ + n". 
3. Let a, b and c be positive reals. Prove that 


1 1 1 &4+584+c8 
as = < 


a bc a@b3c3 


4. (OIM) Find all real positive solutions of the system of equations 


ene aan aie 1994 
Xf + XQ +++ + Xig0g = XT HGH + + Xh004 | 


5. (IMO shortlist) Let a, a2, a3, a4 be positive reals. Prove that 


3 3 3 
a; + G4; + a 


2, 2 2, 2, 
2 aj +a,+ 434+ 44, 
ai + aj + ag 


l<i<j<k<4 


with equality if and only if aj = ay = a3 = ag. 

6. Letn > 1 be an integer and qj, dz, ..., An, D1, bo, ..., by, be given real numbers, 
with aj < ay <-+: < a,andb; < bo < ++: < by. IA, < An < ++: <A, are 
positive reals whose sum is equal to 1, prove that 


(x 2a (x: 2 < 3 Ajab; 
i=l i=l i=l 


and give necessary and sufficient conditions for the equality. To which particular 
case there corresponds Chebyshev’s inequality? 
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7. (IMO shortlist) Let a, b, c and d be nonnegative reals for which ab + be + cd + 
da = |. Prove that 


a 4 b a: 3 4 bB ! 
b+c+td atct+d atb+d at+b+ce 73 


8. For positive reals a, b, c andn € N, prove that 


n nA 


a Cc - qs! + pr} +c" 


+ te 
b+ce atc atb™ 2, 


9. (IMO shortlist) Let x, y and z be positive reals such that xyz = 1. Prove that 


x y? bia 3 
Me ee 
d+yd+z d+xd+y) (U4+xU+y)7 4 

10. (India) Let n > 1 be an integer and x), x2, ..., x, be given positive reals whose 
sum is equal to 1. Prove that 
i Xj n 1 is 
>,/—— > Xj. 
2a Vit a 
11. (Slovenia) Given 2n positive reals a), az, ..., dan, how should we arrange them 
in pairs, such that the sum of the ” products of the numbers of each pair is 


maximal? 
12. (IMO) Let (a;)z>1 be a sequence of pairwise distinct positive integers. Prove 
that, for every n € N, we have 


13. Do the following items: 


(a) If x < yare given positive reals and a = aa prove that a(x + y—a) > xy. 


(b) Use item (a) to furnish another proof for the inequality between the 
arithmetic and geometric means. 


14. (TT) Let a), a2,..., a, be given positive reals. Prove that 
2 2 2 n 
(1+ “1) (1+2)...(1+ <1) > T]a+ay. 
a2 a3 a kel 
15. (Taiwan) Let n > 3 be an integer and xj, x2, ..., x, be nonnegative reals whose 


sum equals 1. Prove that 


4 
xix? + 2x5x3 + x34 + eb xy s a7 


5.3 


16. 


17. 
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If a, b, c and d are nonnegative reals such thata < l,a+b<5,a+b+c < 14 
anda+b-+c+d < 30, use Abel’s inequality to prove that 


Jat+ Vb+ Je+ Vd < 10. 


., by be real numbers such that aj > a2 > +--+: >a, >0 


Let aj,..., dy, and bj,.. 
., bjb2...bn > ayd2... An. Show that 


and b; > ay, bjbz > ajap,.. 


by + bp te + bn > a tanto + an. 


Chapter 6 
The Concept of Function 


With the algebraic background of the previous chapters at our disposal, we now 
begin the study of real functions of a single variable. After presenting the basic 
concepts of domain, codomain and image, some relevant examples of functions 
are given. Then, we introduce the notions of monotonicity and extremal values, 
discussing various examples that, in spite of their elementary character, will reveal 
themselves to be very useful. The chapter continues with the study of the operations 
of composition and inversion of functions, and culminates with the orchestration of 
the whole of it to a first study of implicitly defined functions. We end the chapter by 
discussing, in its last two sections, graphs of elementary functions. 


6.1 Definitions and Examples 


Let X and Y be two given nonempty sets. Informally, a function f from X to Y is a 
rule that associates to each x € X a unique y € Y. It is sometimes useful to visualize 
a function f : X — Y in a more concrete way, by means of diagrams as that of 
Fig. 6.1, where each arrow indicates which element y € Y is associated to a given 
xeEXx., 

We write f : X — Y to denote that f is a function from X to Y. In this case, 
the element y € Y associated to x € X via f is denoted by y = f(x), and is called 
the image of x € X by f. In the example of Fig. 6.1, we have X = {1,2, 3}, 
Y = {a,b,c,d} andf(1) = a, f(2) = a, f(3) = c. Thus, a is the image of 1 and 2 
by f, and c is the image of 3 by f. 

The discussion of the previous paragraph makes it clear that the definition of 
function allows, in the corresponding diagram, that one or more elements of Y do 
not receive arrows, or that one or more elements of Y receive more than one arrow 
(observe that both these possibilities are present in Fig. 6.1). 
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Fig. 6.1 Example of a xX Y 
function f from X to Y 

f 
Fig. 6.2 No arrow starts at xX Y 
lex 
Fig. 6.3 More than one xX Y 


arrow starts at 1 € X 


Note, however, that the diagrams in Fig. 6.2 and 6.3 do not correspond to 
functions. The situation of Fig. 6.2 is forbidden because there is no arrow departing 
from | € X. The situation of Fig. 6.3 is forbidden because more than one arrows 
departs from | € X. 

The next three definitions isolate some quite useful types of functions. 


Definition 6.1 Let X and Y be given nonempty sets. For a fixed element c € Y, the 
constant function c from X to Y is the function f : X — Y such that f(x) = c for 
every x € X. 
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Thus, in the extreme case of the constant function equal to c, defined as above, 
every x € X is associated to a single y € Y, namely, y = c. Nevertheless, the 
conditions required by the “definition” of function are fully satisfied, i.e., every x € 
X is associated toa unique y € Y. 


Definition 6.2 Let X be a given nonempty set. The identity function of X is the 
function Idy : X — X, such that Idy(x) = x, for every x € X. 


As in the previous definition, it is immediate to see that the conditions required 
by the “definition” of function are satisfied, so that Idy is indeed a function from X 
to itself. 

For the next definition, given an arbitrary n € N, we let J, be the set whose 
elements are the first n natural numbers, ie., J; = {1}, b = {1,2}, 5 = {1,2,3} 
and, more generally, 


I, ={kEN, 1 <k <n}. 


Definition 6.3 An infinite sequence of real numbers is a function f : N—-> R.A 
finite sequence of real numbers is a function f : J, > R, for some n € N. 


As was anticipated in Chapter 3, given a sequence f : N > R (resp. f : J, ~ R) 
andk € N (resp. k € J,), one uses to write ax, instead of f(k), to denote the image of 
k by f. In this case, one also writes (a,)x>1 (resp. (ax) 1<¢<n) to denote the sequence 
as a whole, and says that a, is its k -th term. 

From a mathematically rigorous viewpoint, a function is a particular type of 
relation, according to Definition 6.4 below. In order to state it properly, we begin by 
recalling two simple facts from elementary set theory’. 

Given nonempty sets X and Y, and elements x € X and y € Y, the ordered pair 
(x, y) is defined by 


(x,y) = {{x}, &% y}}. 
From this, it is immediate to prove that, if x,x’ € X andy, y’ € Y, then 
(x.y) =.) &@x=x' and y=y’. 
The cartesian product of X and Y (in this order), denoted X x Y, is the set 
Xx Y= {(x,y); x € X and ye ¥}. 
Definition 6.4 Given nonempty sets X and Y, a relation from X to Y (or between 
X and Y, in this order) is a subset of the cartesian product X x Y, i.e., R is a set of 


ordered pairs (x, y), with x € X andy € Y. If ® is a relation from X to X, we simply 
say that 7e is a relation on X. 


‘For a more thorough account of set theory, we refer the reader to the classic [15]. 
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Example 6.5 Let X = {1,2,3} and Y = {2,3,4, 5}. The set 
R = {(x,y) €XxY;x> y} 


is a relation from X to Y, given by R = {(2, 2), (3, 2), (3, 3)}. Indeed, these are the 
only ordered pairs (x, y), with x € {1,2,3}, y € {2,3, 4, 5} and such that x > y. 


The example above illustrates an obvious procedure we can use to define a 
specific relation R between given nonempty sets X and Y (in this order): it suffices 
to declare, somehow, a subset of the cartesian product X x Y; those ordered pairs of 
X x Y that satisfy the given prescription will be precisely the elements of R. 

If R is a relation from X to Y, then R C X x Y, by definition. Conversely, once 
we choose an ordered pair (x, y) € X x Y, then either (x, y) € R or (x,y) € R. In 
the former case, we say that x and y are related by R, and write x Fy; in the latter 
case, we say that x and y are not related by R, and write xX y. Thus, in symbols, 


xRyS w&YER. (6.1) 


Therefore, with respect to the relation of Example 6.5, we have 3R2 but 2%3, 
since 2 > 3 is false. 

Among all types of relations we can consider, one of the most important ones is 
given by the coming definition. 


Definition 6.6 Given nonempty sets X and Y, arelation R from X to Y isa function 
if the following condition is satisfied: 


VxeX, dauniqueyeY; xRy. 


As in the beginning of this section, given nonempty sets X and Y, it is customary 
to use smallcase latin letters like f, g, h etc to denote functions from X to Y. Also 
as before, we write f : X — Y if f is such a function, and write f(x) = y if the 
ordered pair (x, y) € X x Y belongs tof, i.e., if it is such that xf y. Notice that such a 
notation makes sense, for the definition of function assures that, if (x, y,) and (x, y2) 
are ordered pairs in X x Y such that xf y, and xf y2, then y; = y2. On the other 
hand, a moment’s thought shows that the formal definition of function given above 
is simply the correct way of validating the informal definition given at the beginning 
of this section. 

We will usually work with functions f : X — Y such that X,Y C R. In these 
cases, we will generally indicate which element f(x) € Y is associated to a generic 
element x € X by means of a formula in x that ought to be seen as a rule to get 
f(x) out of x. For instance, we can say “consider the function f : R — R given 
by f(x) = x?” to mean that the function f associates, to each x € R, its square x’. 
Observe that the defining requisites of a function are fulfilled, for, to each x € R, 
we have associated a single f(x) € R, namely, x’. In particular, yet with respect to 
this example, we have f(/2) = (2)? = 2, f(3) = 37 = 9 ete. 
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When X,Y C Randf : X — Y isa function such that the element f(x) € Y 
associated to x € X is given by a formula in x, we will sometimes denote such a 
correspondence by writing 


f:X— Y 
xr fx) 


This way, the function of the previous paragraph, that associates to each x € R its 


square x”, could have been denoted in the following manner: 


f:R—OR 
xR x) 


Let’s see another example. 


Example 6.7 Consider the function f : Q > R, given by 


Vx2+1, ifx <0 
f(x) = ’ 
x+1, ifx>0 
We surely have defined a function, for the expressions that define f(x) have sense 
in R and, although we must apply different formulas, according to the rational x 
satisfies x < 0 or x > 0, each rational x has a single, well defined image f(x). Thus, 
for example, f(—1) = /(—1l)? +1 = V2 (since —1 < 0), but f(2) = 2+1 =3 
(since 2 > 0). 

Observe that we could have defined f(x) by writing 


Vx2+1, ifx <0 
fx) = 
x+1, ifx>0 
If that were the case, then, albeit conditions x < 0 and x => 0 cover all rationals, they 
wouldn’t be mutually exclusive: x = 0 would satisfy both. However, the formulas 
that should be applied in one case or the other would give the same result for x = 0 
(since V0? + 1 = 0 + 1), thus avoiding any possibility of inconsistence. 


We sometimes say that a function f like that of the previous example is defined 
by parts. This expression alludes to the fact that there is a formula to compute 
f(x) when x € (—oo, 0], and another one to compute it when x € (0, +00) (or 
x € [0, +00), as one may wish). 

When dealing with a function f : X — Y, it’s frequently useful to refer to the sets 
X and Y as the domain and codomain of the function, respectively; in this context, 
we will write X = Dom (f). For instance, for the function f of Example 6.7, the 
domain and codomain are, respectively, Q and R. 

We will often work with functions f : X — R, with X C R. In cases like 
these, we shall say that f is a real function (alluding to the fact that it assumes real 
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values —1.e., that its codomain is R) of a real variable (now alluding to the fact 
that a generic element x of the domain X of f — the variable of the function — is a 
real number). 

In cases like those of the previous paragraph, if f(x) is defined by a formula on 
x, then, unless explicitly stated otherwise, we will stick to the usage of taking X as 
the largest possible domain. In other words, in such cases we will take X to be the 
largest possible subset of R in which the mathematical operations that define the 
expression f(x) have sense. We shall, then, say that X is the maximal domain of 
definition, or simply the maximal domain of f. 


Example 6.8 Find the maximal domain of definition X C R of the function f : X > 


: ees 
R, given by f(x) = wc 
. 1 ayy : = 
Solidon For chi to be a real number, it’s necessary and sufficient that x(x — 1) 
be positive. Thus, 
: 1 
x ={rer, —1_ er! 
Vx(x — 1) 


= {x € R; x(x— 1) > O}. 


A product of two real numbers is positive only if both factors have the same sign. 
Therefore, we must have x > 0 and x — 1 > 0, or else x < 0 and x — 1 < 0. Hence, 
we must have x > 1 or x < 0, so that 


X = (—00, 0) U (1, +00). = 

In the context of real functions of a real variable, there are standard ways of 
building new functions out of other known ones: we just have to use the arithmetic 
operations of their codomain R. More precisely, given a nonempty set X C R, a 
real number c and functions f, g : X — R (with the same domain!), we define the 
functions f + g,f-g,c:f:X — R by setting 


(f + g)(x) = f(%) + gQ), 
(f- g)(x%) = f(x) - g(), 
(c-f\@) =c-fQ), 


for every x € X. 

Some remarks are in order. First of all, notice that the addition signs in the first 
equality above have distinct meanings: in the left hand side, the + sign is used in 
the very definition of the function f + g, while in the right hand side f(x) + g(x) 
stands for the usual addition of the real numbers f(x) and g(x). Analogous remarks 
are valid for the multiplication signs used in the definitions of the functions f - g and 
c-f. Secondly, as is usual with real numbers, we shall generally omit multiplication 
signs, writing fg and cf instead of f- g and c-f. This usage will not be source of any 
kind of confusion. 
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It is evident that f + g, fg and cf are indeed functions from X to R. On the other 
hand, by analogy with real numbers, f + g and fg are called the sum and the product 
of f and g, respectively. Also, note that the product cf of the real number c by the 
function f can be seen as a particular case of the product of two functions: taking 
g : X — R to be the function identically equal to c, we have fg = cf; yet for such a 
g, we shall denote f + g simply by f + c, so that 


F+o@ =fa) +e, 


for every x € X. 


x 


Example 6.9 Let f and g be functions from R to R given by f(x) = zy, and 
g(x) = —x + 3, for every x € R. Then, 


(+ 80) =f) + 80) = = + r+) 
_ x + (x? + 1)(—x + 3) _ —x + 3x7 +3 
~ +1 ~ e+)? 

243 
(fe)(2) = FOI) = SA Cx +3) = 


and 


Xx xJ3 
(V3) (0) = V3f() = V3- sas 


We leave to the reader the task of verifying that the operations of addition and 
multiplication of functions, defined as above, satisfy properties analogous to those 
satisfied by the corresponding operations with real numbers. More precisely, for 
functions f, g, : X — R and real numbers a, b, we have: 


* Commutativity: f + g = g+4+f;fg = ef. 
+ Associativity: f + (g +h) = (f +g) +h; f(gh) = (fe)h; a(bf) = (abyf. 
¢ Distributivity: f(g + h) = fg + fh; (a+ b)f = af + Df. 


Finally, note that the associativity property of the operations of addition and 
multiplication of two functions allows us to define, in entirely analogous ways, the 
sum and the product of an arbitrary finite number of functions from X to R. For 
example, given functions f, g,h : X —> R, we define f + g +h: X > Ras either 
(f+g)+horf+(g+h), since these are equal functions; analogously, fgh : X > R 
can be defined as either (fg)h or f (gh). 

For an additional extension of the discussion of operations on functions, we refer 
the reader to Problem 8. 

Turning our attention to the codomain of a general function f : X — Y, it’s 
important for the reader to realize that Y doesn’t generally coincide with the set of 
the images of the elements of X (which, anyhow, is a subset of Y). Let’s illustrate this 
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point using again the function f of Example 6.7. We have already observed that its 
codomain is the set R of real numbers. On the other hand, it’s certain that its subset 
{f (x); x € Q} doesn’t contain real numbers less than 1. Indeed, since x* + 1 > 1 for 
any real x, we have 


fx) = VP 4+1>V1=1, 


whenever x < 0 is rational; on the other hand, for a rational x > 0, we have f(x) = 
x+1> 1. In any case, 


if (x); x € Q} C [1, +00), 


which is a proper subset of the codomain R of f. 
More generally, given a function f : X — Y, the image of f is the set Im/(f), 
whose elements are the images f(x) € Y of the elements x € X: 


Im(f) = {f() € Y; x € X}. 


In particular, we always have Im(f) C Y. 

In the example just discussed, we showed that the image of a function can be a 
proper subset of its codomain. Nevertheless, we didn’t get an explicit description of 
the image of the function under consideration. As further developments will show, 
in specific situations this can be a somewhat hard task. For the time being, let’s 
consider an illustrative example. 


Example 6.10 Find an explicit description of the image of the function f : Q\{0} > 
Q given by f(r) = i if the nonzero rational r is written as r = 7, witha € Z,be N 
and ged(a, b) = 1°. 


Solution The function f is well defined, in the sense that its definition is not 
ambiguous. Indeed, it’s a standard fact that every nonzero rational number admits 
a unique representation as the quotient of two relatively prime integers, the 
denominator being a natural number; for example, —2 = +, so that f (—2) = i. 
On the other hand, in the notations of the statement of the example, since b € N, we 


immediately see that 
1 111 
I {-;beN={1-25,.... 
mien ¢ ee 


Moreover, it is clear that all elements of the last set above do belong to the image of 
f, forf (+) = ts for every b € N. Hence, we conclude that 


?For further details on the gcd of two nonzero integers, read again the introduction to Chapter 1 or, 
alternatively, read Section 6.2 of [5]. 
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111 
Im(f) = {l.5.5.5.---}. . 

Unfortunately, there exists no algorithm that allows us to explicitly describe the 
image of an arbitrarily given function. Nevertheless, in subsequent chapters we will 
solve this problem for several important classes of functions. 

We finish this section discussing the important concept of equality of functions. 
With respect to the function f of Example 6.7, it makes no sense to consider f(/2), 
since /2 ¢ Q and the domain of f is Q. What we could do would be to consider, 
instead of f, the function g : R > R, given by 


toe Vx? +1, ifx<0 
x+1,ifx>0 . 
Although the formulas that define f(x) and g(x) are the same, for f they can be 
applied only to x € Q, while for g they can be applied to every real x. Therefore, it 
makes no sense to think about f and g as being equal functions, just denoted in two 
different ways. 

In the positive direction, we have the following 


Definition 6.11 Functions f : X — Y andg: W > Z are equal if X = W,Y=Z 
and f(x) = g(x), for every x € X. 


Iff : X > Y and g: W — Z are equal functions, we write f = g. We also stress 
that, according to the above definition, the equality of f and g doesn’t merely reduce 
to f(x) = g(x); it also means the equality X = W of the domains of f and g, as well 
as the equality Y = Z of the codomains of these two functions. If functions f and 
g as above are not equal, we will write f # g and say that f and g are different or 
distinct functions. 


Problems: Section 6.1 


1. Find the maximal domain of definition of the function f, such that f(x) = = 


i 


2. Find the maximal domain of definition of the function f, such that f(x) = 


\t-¥i- ve 


3. Let f : Q% > Qy be the function defined by f (4) = S—4!, ifa,b € N are 
relatively prime. 


(a) Compute f(1), (10) and f (2). 
101 


(b) Among the rationals 3, = and 5, which do belong to the image of f? 
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10. 


11. 


12. 
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. Consider the function f : R > R given by f(x) = x° — 2x” + 5x, for every 


x € R. Prove that f(x) has the same sign of x, for every real number x # 0. 


. Function f : R > R is such that f(1) = 2, f(./2) = 4 and f(x+ y) = ff), 


for every x, y € R. Compute the value of f(3 + V2). 


. Let f : R = R be a function such that f(x + y) = f(x) + f(), for every reals 


x and y. If (a;)z>1 is an AP of common difference r, prove that the sequence 
(f (ax))x>1 is an AP of common difference f(r). 


. Let f : R — R be a function such that f(x + y) = f(@)f(y), for every reals 


x and y. If (ax)x>1 is an AP of common difference r and such that f(a1) 4 0, 
prove that the sequence (f(a,))x>1 is a GP of common ratio f(r). 


. * Given a nonempty set X C R and functions f,g : X — R, extend the 


discussion in the text, furnishing adequate definitions to the difference f — g 
and the quotient £ of the functions f and g. 


. * The integer part of a real number x, denoted |x|, is defined to be the greatest 


integer which is less than or equal to x. For instance, || = 3, —3| = —2 and 
|1| = 1. Find the image of the integer part function 


||:R— R (6.2) 

xt > [x]? , 
that associates to each x € R its integer part |x|. 
* The fractional part of a real number x, denoted {x}, is defined as {x} = 
x — |x|, where |x| stands for the integer part of x (see the previous problem). 
For example, {7} = x — 3, {-3} = —3 —(-2) = 5 and {I} = 1-1=0. 
Find the image of the fractional part function 


{}:R—R . (6.3) 
x > {x} 

that associates to each x € R its fractional part {x}. 

(TT) Prove that the n—th natural number which is not a perfect square is equal 

to|n+ f/n+ s|, where |-| is defined as in Problem 9. 

*Letf : Q — Q bea function such that f(x+ y) = f(x) + f(y), for all x,y € Q. 

Prove the following identities, for all x, y € Q and m,n € Z, withn F 0: 


(a) f(0) = 0 and f(—x) = —f(). 
(b) f@—y) =f@) —f). 

(c) f(x) = mf (x). 
Oras. 

(e) f (7) = Ff). 
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We start this section concentrating ourselves in the problem of finding the image of 
a given function. To this end, recall that the image of a function f : X — Y is the set 


Im(f) = {f(x) € Y; x € X} 
= {y © Y; y= f(x) for some x € X}. 


This second way of declaring Im/(f) is particularly useful in case f is a real function 
of a real variable, i.e., if f : X > R, with X C R. Indeed, for such an f, if the values 
f(x) are given by a formula on x € X, we can look at the problem of finding the 
image of f as that of finding the y € R for which the equation f(x) = y has at least 
one solution x € X. Let’s see some examples. 


Example 6.12 An affine function is a function f : R — R such that f(x) = ax+b, 
for every real x, where a and b are given real numbers, with a # 0. A linear function 
is an affine function f as above, such that b = 0. 

According to the previous paragraph, the image of an affine function f as above 
can be found by searching for the y € R for which the equation ax + b = y has 
at least one solution x € R. Since a # 0, this equation always admits the solution 
x= — Therefore, we conclude that every y € R belongs to the image of f and, 
hence, Im(f) = R. 


Example 6.13 The function of inverse proportionality is the function f : R \ 
{0} > R \ {0} given by f(x) = 1, for every x € R \ {0}. 

In order to find its image, it suffices to find all y ¢ R for which there exists a 
real number x # 0 (i.e., x belonging to the domain of f), such that f(x) = y, ie., 
such that + = y. If y = 0, it is clear that this equation doesn’t admit solutions; on 
the other hand, if y 4 0, this same equation admits the solution x = i # 0. Hence, 


Im (f) = R \ {0}. 


It is time we define one of the most important classes of elementary real functions 
of a real variable. 


Definition 6.14 A quadratic, or second degree function is a function f : R > R 
such that f(x) = ax? + bx +c for every real x, where a, b and c are given real 
numbers, with a # 0. The discriminant A of f is the discriminant of the associated 
second degree trinomial ax? + bx + c, so that A = b? — 4ac. 


The problem of finding the image of a quadratic function is sufficiently important 
to be collected in the following 


Proposition 6.15 With respect to the quadratic function f(x) = ax” + bx +c, we 
have that: 


(a) Ifa > 0, then Im(f) = [—4, +00). 


(b) Ifa <0, then Im(f) = (—00,-4]. 


154 6 The Concept of Function 


Moreover, in any of the above cases, 


b 


A 
le ear aan 


Proof Following the general procedure described at the beginning of this section, 
it suffices to find all y € R for which the equation axrt+bx+ec= y, i.e., the 
second degree equation ax* + bx + (c — y) = 0, has at least one real solution. 
As we already know from Section 2.3, a necessary and sufficient condition for the 
existence of such (a) root(s) is that this last equation has nonnegative discriminant, 
i.e., that b? — 4a(c — y) = 0. Since we agreed to let b* — 4ac = A, the y’s we’re 
looking for are precisely the solutions of the first degree inequality 


A + 4ay => 0. 


Now, we separately consider the cases a > 0 anda < 0. If a > 0, then 
A 
4ay+ A>0Sy>-—, 
4a 
and it follows that 


Im (f) = peRy=-2 7 |-z +00): 
4 a 


a 4 
if a < 0, then 
A 
fJay+ A>0@y<-— 
4a 
and, hence, 
A A 
Im (f) = jy eR; y< —— 7 = [ -00,-—]. 
4a 4a 


To what was left to do, notice that, for y € Im(f), the solutions of the equation 
ax? + bx +c = y(@ ax? + bx+ (c—y) = 0) are 


—b+ /b?—4a(c—y) —b+ /A+ 4ay 
° — ee eee 


6.4 
2a 2a Of) 


Therefore, equation f(x) = y admits a single solution if and only if A + 4ay = 0, 
or, which is the same, if and only if y = —A; this being the case, we have from (6.4) 
that x = —2. oO 
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To what comes next, we make the convention of saying that the quadratic 
function f(x) = ax? + bx + c has constant sign if f(x) => 0 for every x € R, or 
f(x) < 0 for every x € R. 


Corollary 6.16 The quadratic function f(x) = ax? +bx+c has constant sign if and 
only if A < 0. In this case, we have af (x) = 0 for every x € R. In other words: 


(a) If A < Oanda > 0, then f(x) = 0 for everyx € R. 
(b) If A < Oanda < 0, then f(x) < 0 for everyx ER. 


Proof Let’s look at the case a > 0, the case a < 0 being totally analogous. If A < 0, 
it follows from the previous proposition that 


A 
fW2-7 729, VxeER. 
a 


Conversely, suppose that a > 0 and that f has constant sign. Again from the 
previous proposition, the image of f contains positive numbers, so that we must 
have f(x) = 0, for every x € R. In particular, 


Therefore, A < 0. oO 


Remark 6.17 A simple modification of the argument presented in the proof of the 
previous corollary allows us to conclude that 


i. If A <Oanda > 0, then f(x) > 0 forallx € R. 
ii. If A < Oanda < 0, then f(x) < 0 forallx ER. 


From now on, we shall use these results whenever needed, without further com- 
ments. 


The previous corollary can also be used to give a much simpler proof of Cauchy’s 
inequality (cf. Theorem 5.13). 


Example 6.18 Let n > 1 be an integer and aj, dz, ..., dn, bi, bo, ..., by be real 
numbers, such that at least one of the a;’s and at least one of the b;’s is nonzero. 
Consider the quadratic function 


FQ) = (ax— bi)? + Gx — bo)? +++ + (Qnx — Dn)’ 
= Ax’ —2Bx+ C, 


where A = aj + a3 +---+ a> > 0 (for, at least one of the a;’s is nonzero), B = 
ayby + ayby + +++ + Gnby and C = bt + b5 +--+ +b. 

Since f(x) is a sum of squares, we have f(x) => 0 for every x € R. On the other 
hand, since A > 0, Corollary 6.16 gives A = 4(B* — AC) < 0. Hence, B? < AC, 
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or, which is the same, |B| < A/C. Substituting the values of A, B and C, we get 
Cauchy’s inequality. 

According to the above reasoning, equality in Cauchy’s inequality is equivalent, 
for the function f, to A = 0. In turn, this is equivalent to the existence of a single 
a € R such that f(a) = 0. However, since f(a) is a sum of squares, the only way 
we can have f(a) = 0 is if each one of these squares vanishes, i.e., if 


qa—b,j =aa—by =---=a,a—b, = 0. 


Finally, since at least one of the b;’s is nonzero, we have a ¥ 0 and, writing A = 1, 
we get 


a\ = ib, a2 = Ab, ee dn = Ab, 


as a necessary and sufficient condition for equality. 
In order to continue in our study of function, we need a piece of terminology. 
Definition 6.19 Let J C R be an interval. A function f : J > R is said to be: 


(a) increasing, if f(x) < f(x2), for all x < x in J. 
(b) decreasing, if f(x) > f (x2), for all x; < x, in J. 
(c) nondecreasing, if f(x;) < f(x2), for all x; < x) inJ. 
(d) nonincreasing, if f(x) > f (x2), for all x; < x, in/. 


Moreover, in any of the cases above, we say that the function f is monotonic in /°. 


Regarding the above definition, an interesting (and, as we shall see, important) 
problem is the one of finding the monotonicity intervals of a function f : J > R, 
where / C R is an interval. By that we mean to find the intervals J C J such that f 
is increasing (resp. decreasing, nondecreasing or nonincreasing) in J. Here we shall 
see some elementary examples, postponing a more general analysis to Chapter 9. 


Example 6.20 The affine function f : R — R, given by f(x) = ax+ J, is increasing 
if a > 0 and decreasing if a < 0. 

Let’s verify this claim in the case a > 0, the analysis of the case a < 0 being 
totally analogous. Letting x; < x2 be two real numbers, it follows from a > 0 that 


f (x2) —f(%1) = (ax. + b) — (ax, + Bb) = a(xy — x1) > 0. 


Therefore, f is increasing. 


3In the notations of this definition, it is worth observing that, for some authors, a function f 
satisfying the condition of item (a) (resp. of item (b), (c) or (d)) is said to be strictly increasing 
(resp. strictly decreasing, increasing or decreasing). 
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2 


Example 6.21 The function f : [0, +00) — R, given by f(x) = <5, is increasing 
in the whole interval [0, +00). To check this, take real numbers 0 < a < b. Then, 


2 2 


b+2 a+2 
_ 1 
~ (a+ 2)(b + 2) 


f(b) -f(@ = 


[b?(a + 2) —a’(b + 2)], 


and, since (a + 2)(b + 2) > 0, it suffices to show that b?(a + 2) — a*(b+ 2) > 0. 
To this end, start by writing 
b°(a+2)—a’(b+2) =b’a—a’b + 2(b — a’) 
= ab(b—a) + 2(b—a)(b+a) 
= (b—a)|ab+ 2(b+a)]. 
Now, since 0 < a < b, both factors in the last product above are positive, so that 
b’(a+2)—a*(b+2)>0. 
Example 6.22 Function f : R > R, given by f(x) = x° + 2x, for every x € R, is 
increasing. Indeed, for any real numbers a < b, we have 
fo)-f@ =b? —@ +2b-2a 
= (b—a)(b’ + ba + a’) + 2(b-a) 
= (b—a)(b’ + ab + a’ +2). 
Since b — a > 0, it suffices to show that a? + ab + b? + 2 > 0; in order to do this, 


one possibility is to use the inequality between the arithmetic and geometric means 
for two numbers: 


a +b? +ab+2> 2\ab| + ab+2 > |abl| +2>0, 
where, in the next to last inequality, we used the fact that |w| + a@ > 0, for every 


aeéeR. 


The coming proposition solves, for quadratic functions, the problem of finding 
the monotonicity intervals. 


Proposition 6.23 Let a, b, c € R, with a £ 0, and f(x) = ax* + bx + c for every 
xéER. 


(a) Ifa > 0, then f is decreasing in (—oo, —-£] and increasing in [-£, +00 


= Ne 


eae 


(b) Ifa <0, then f is increasing in (—o0, 7 


| and decreasing in [-4, +00). 
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Proof Let’s do the proof of item (a), the proof of item (b) being totally analogous. 


For x. > x; > -#, we have 


f2) — fer) = a3 — x4) + BG» — x1) 


b 
= a(x2 — x1) (x» +x + -) > 0, 


since x. > x, > -2£ gives x2 — x; > 0, as well as x2 +x; + 7 > 0. oO 
The next definition is, in a certain sense, complementary to Definition 6.19. 


Definition 6.24 Let J Cc R be an interval and f : ] > R be a given function. We 
say that yo € R is the minimum value of f in / if the two following conditions are 
satisfied: 


(a) Im (f) Cc Lyo. +00). 
(b) yo € Im(f). 


In this case, the real numbers x) € J such that f(x9) = yo are called the minimum 
points of the function /. 


Similarly, we define what one means by the maximum value and the maximum 
points of a function f : J > Rd C R being an interval). The maximum and 
minimum points of a given function (provided they exist) are collectively called its 
extreme points; accordingly, the values the function takes at those points are its 
extreme values. 

In Section 9.6, we shall see how to search extreme points for differentiable 
functions, i.e., functions possessing derivatives. For the time being, we shall content 
ourselves in analysing some elementary examples, the first of which being an 
immediate consequence of Proposition 6.15. 


Proposition 6.25 With respect to the quadratic function f(x) = ax* + bx 4+ c, if 


a > 0 (resp. a < 0), then -£ is the only minimum (resp. maximum) point of f. 


Moreover, the minimum (maximum) value of f is —2. 


The proposition above has several interesting applications, two of which are 
collected below, for the sake of illustrating its use. For the necessary geometric 
background, we refer the reader to [4]. 


Example 6.26 There is given a semicircle of lcm of radius. A rectangle is so 
situated that one of its sides lies on the diameter of the semicircle, whereas its other 
two vertices lie on the semicircle itself. Compute the largest possible value for the 
area of the rectangle. 


Solution Let Fig. 6.4 account for the described situation, so that AB is the diameter 

of the semicircle, O is its center and PQRS is the given rectangle, with PQ C AB. 
Setting OO = x and OR = y, the area of PORS equals 2xy. On the other hand, 

applying Pitagoras’ theorem to triangle OQR, we get x” + y” = 1 and, hence, 
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Fig. 6.4 Maximizing the 
area of rectangle PORS 


A P OQ B 


Qxy = 2xV 1 — x? = 2271 — x7) = AV? — x4” 


Now, making the substitution z = x’, it follows from the last expression above for 
the area that it suffices to maximize the quadratic function f(z) = z — z’, subjected 
to the condition 0 < z < 1 (for, x < OR = 1). By Proposition 6.25, such a 
function (without any further restrictions) admits z = 5 as its only maximum point. 
Moreover, since 5 € (0, 1), it follows that the desired maximum value is f (5) = i. 
Therefore, the maximum value for the area is me a oO 


Example 6.27 Given a triangle ABC in the plane, show that its baricenter is the 


only point P in the plane of ABC for which the sum AP + BP + CP attains its 
minimum possible value. 


Proof Fix a cartesian system of coordinates in the plane, with respect to which 
A(x1,¥1), Bt, y2) and C(x3, y3). If P(x, y), then the formula for the distance 
between two points in the plane furnishes 


AP + BP + CP =f(x) +). 


where 


3 
f@= Yi@-xi)? = 3x —2(x, + x2 + x3)x + (x7 +3 4 a) 


i=1 


and, analogously, g(y) = 3y? — 2(1 + y2 + y3)y + (Yt + y5 +.Y3). 

Since x and y are independent variables, in order to minimize AP + BP + 
CP it suffices to minimize the quadratic functions f and g. To this end, invoking 
Proposition 6.25 we conclude that f and g attain their minimum values only at the 
points x = $(x1 + x2 +43) andy = (1 + y2 + y3), respectively. However, it is 
a well known fact (cf. Chapter 6 of [4], for instance) that these are precisely the 
coordinates of the baricenter of triangle ABC. oO 


To address the problem of finding the maximum and/or minimum values of a 
given function, another elementary strategy which is sometimes useful is to resort 
to inequalities. In what follows we shall see two examples along these lines. 
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Example 6.28 Let f : [0,+0o) — R be the function given by f(x) = - for 


every x € R. What is the minimum value of f? Does f attain a maximum value? 


Solution First of all, note that 


eel  =12 


IOS lag = x+1 


ie =(x+1)+ es 
=x- — x —_ - 
x+1 x+1 


Therefore, applying (5.2) (witha = x+ landb = =) we get 


(x + +2 2yeet 1). —— 2V2, 


with equality if and only if x + 1 = a , Le., if and only if x? + 2x— 1 = 0. Since 
x > 0, we conclude that equality takes place in the above inequality if and only if 
x= J/2-1. Thus, for x > 0 we have 


f@®=@+ I+ 5-22 2V3-2, 


so that 2./2 — 2 is the minimum possible value for f, and it is attained only at 
x=J/2-1. 

To what is left, just notice that, for n € N, we have f(n) = n— 1+ — <a >n-l. 
Therefore, f does not attain a maximum value. oO 


Example 6.29 Find the maximum value and the maximum point(s) of the function 


f : [0, +00) > R, given by f(x) = we. 


Solution It follows from the inequality between the arithmetic and geometric 
means that 


eae eee 
XxX — — — — 
a ae: 


so that 
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Equality holds if and only if x* = e which is the same as x = Fz (since x > 0). 


Thus, at is the maximum value of f and it is attained only at x = ee Oo 
Problems: Section 6.2 
1. Find the image of the function f : R > R, given by f(x) = = , for every 
xeER. 
2. * Find the image of the function f : R* — R, given by f(x) = x + 1, for every 
x € R*. 


3. Let J C R be an interval, let a € J and f : J > R bea given function. If f is 
increasing (resp. decreasing) in (—oo, a] N J and decreasing (resp. increasing) 
in [a, +oo) NJ, prove that a is the only maximum (resp. minimum) point of f 
inl. 

4. * Let X C R be a nonempty set, f : X — R be a given function andc ¢€ R. 
Relate the images of the functions f and f + c. More precisely, if Y = Im(f), 
prove that Im (f + c) = Y +c, where Y + c denotes the set 


Y+c={y+oaye ¥}. 
5. * Let X C R be a nonempty set, f : X — R be a given function and c € R*. 
Relate the images of the functions f and cf. More precisely, if Y = Im(f), 
prove that Im (cf) = cY, where cY denotes the set 


cY = {cy; ye Y}. 


6. Motivated by the canonical form of the second degree trinomial ax? + bx + c, 


from now on we shall say that 
2 ~ oR 
XxX — ——— 
2a 4a2 


is the canonical form of the quadratic function f(x) = ax* + bx + c. Use this 
canonical form to give another proof of Proposition 6.15. 

7. Let f(x) = ax? + bx + c be a quadratic function for which A > 0, and x; < x2 
be the roots of f(x) = 0. Prove the following items: 


f(x) =a (6.5) 


(a) Ifa > 0, then f(x) <0 @ x € (1, x2). 
(b) Ifa < 0, then f(x) <0 © x ¢ [x1, xy]. 


8. Let f(x) = ax* + bx + c be a quadratic function. If there exists a real number 
Xo for which af (x9) < 0, prove that A > 0 and xo € (x1, x2), where x; < x2 are 
the roots of the equation f(x) = 0. 
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Among all rectangles of a given perimeter, prove that the one of largest possible 
area is the square. 

The cross section of a tunnel has the shape of a semicircle of radius 5m. The 
tunnel has two lanes of traffic, which go in opposite directions and are separated 
one from the other by a narrow median. The trucks of a transportation firm are 
to cross the tunnel to take goods from one city to another. If the trucks are 18m 
long, what should be their widths and heigths, so that they can carry the largest 
possible load volume per travel? 

Compute the maximum value of the function f : R > R given by f(x) = 
for everyx € R. 

Let a; < a <--- < a, be given reals, and f : R > R be given by 


5x—1 
x2+1? 


F(x) = |x— | + |x — | +--+ + [x -— ey], 


for every x € R. Prove that f attains a minimum value and compute it in terms 
Of G1, 2, ..., Ay. 

In each of the following items, use the inequality between the arithmetic and 
geometric means to compute the maximum value of the given function: 


(a) f: R— R given by f(x) = sa) for every x € R. 


(b) f: R > R given by f(x) = Artie? for every x € R. 
(c) f : [0, 1] > R given by f(x) = x(1 — x°), for every x € [0, 1]. 


In each of the following items, use the inequality between the arithmetic and 
geometric means to compute the minimum value of the given function: 


(a) f : (0, +00) > R given by f(x) = wo for every x € (0, +00). 

(b) f : (0, too) > R given by f(x) = x? + <, for every x € (0, +00), where a 
is a positive real constant. 

(c) f : (0, +00) > R given by f(x) = rer for every x € (0, +00). 


(d) f : (0, +00) > R given by f(x) = 6x + a for every x € (0, +00). 


(Romania) Let x, y € R be such that x? — xy + y? < 2. Show that x* + y* < 8 
and explain when the equality holds. 
(TT) Find all reals x, y, z and ¢t such that 


y=x+2x 
z=y?+2y 
t=2+27 0 
x=fP+2t 


For the next problem, the reader might find it helpful to recall the discussion 
on the gcd of two nonzero integers, at the beginning of Chapter |. alternatively, 
look at Chapter 6 of [5]. 
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17. (OCM) Let f : N > N be a function such that f(mn) = f(m) + f(n) whenever 
m and n are relatively prime. A natural number m is said to be a strangulation 
point of f ifn < m => f(n) < f(m) andn > m => f(n) > f(m). If f has 
infinitely many strangulation points, show that it is an increasing function. 


6.3 Composition of Functions 


Functions f : X —> Y and g: Y — Z give well defined rules for, departing from 
x € X via f, get y = f(x) € Y and then, via g, get z = g(y) € Z. It thus seems 
reasonable that we may form a new function that allows us to go directly from X 
to Z. This is indeed the case, and the corresponding function is called the composite 
of f and g, according to the following 


Definition 6.30 Given functions f : X — Y and g : Y — Z, the composite 
function of f and g (in this order) is the function g of : X — Z, defined for 
x € X by 


(g of)(%) = g(f(@)). 


Roughly speaking, the above definition means that, in order to find the image of 
x € X by gof, it suffices to find the image of f(x) € Y by g. On the other hand, 
it is easy to verify that g o f, as defined above, is indeed a function. Also, observe 
that to form the composite of f and g, it is necessary that the domain of g equals the 
codomain of f. Let’s see some examples. 


Example 6.31 Let X and Y be nonempty sets and f : X — Y an arbitrary function. 
If Idy : X¥ — X and Idy : Y > Y are the identity functions on X and Y, respectively, 
then 


foldy=f and Idyof =f. 


Let’s check the equality f o Idy = f, the other one being totally analogous. To this 
end, it suffices to note that fo Idx is a function from X to Y such that, for evey x € X, 


(fo Idx) (x) = fUdx(x)) = f@). 


Example 6.32 Let f,g : R > R be the functions given by f(x) = x” and g(x) = 


—- for every x € R. Then, g of andf o g are functions from R to R, with 


1 1 1 


(g of)(x) = gfx) = (Gynt @eai 2a 
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and 


2 
1 
(eo) — = 2 — =S 

Fon) =F) = 6) =(S) =a 

The preceding example show an interesting phenomenon: it must happen that 
gof #f og. More precisely, it may happen that we can form g of but cannot form 
f °g (or vice-versa); it suffices to have, for instance, f : X > Yandg: Y > Z, with 
X # Z. Nevertheless, even if we can form both g of and /f o g, it may well be the 


case that gof #fog. 
Example 6.33 Let f,g : (0, +oo) > (0, +00) be functions given by 


r+ x+2 


FG) =F and (fog)@) =. 


for every x € (0, +00). Find an expression for g(x) in terms of x. 


Solution The definition of composite function gives 


X + 
3 


g(x)? + 1 


2 
= (fo g)(x) = f(g) = “BeGe 


2 . . 
t say st = © or, which is the same, 
g(x) 3 


so tha 
3g(x)? + 3 = 3x + 2)g(x). 


Looking at this expression as a first degree equation in g(x)”, we obtain g(x)? = 


1 at |: . eae 
zur and, hence, g(x) = TF for each x > 0. However, since g has positive 
image, we must have g(x) = west for every x > 0. Oo 


The operation of composition of functions, albeit not commutative, is associative, 
as the next proposition teaches us. 


Proposition 6.34 Given functions f :X > Y, g: Y > Zandh: Z > W, we have 


ho(gof)=(hog)of. 


Proof First of all, both h o (g of) and (ho g) of are functions from X to W. Hence, 
we only need to check that they associate, to each x € A, a single element of W. To 
see this, just note that 


(ho (gof))(x) = h((g of)()) = h((gFa))) 
= (ho g)(F(x)) = (hog) of)(x). 
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The previous proposition is quite important, for, it guarantees that if functions f, 
g and h (in this order) can be composed, then the composite function can be safely 
denoted by ho go f, and we need not worry about which composition to perform 
first. As an immediate generalization of this remark, suppose functions f, g, h and / 
(in this order) can be composed. Then, the previous proposition gives 


lo(ho(gof)) = (Loh) o(gof) =(Wloh)og)of =, 


so that the order in which we insert parentheses does not alter the composition. 

It is not difficult to see that the situation described in the last paragraph remains 
true for the composition (whenever possible) of any finite number of functions. In 
particular, given a nonempty set X, a function f : X — X and a natural number 
n, this last fact allows us to unambiguously define the n—th composite function 
f™ :X > X, by setting 


fi =fofo--of. 


n 


Example 6.35 Let f : R \ {-1, 1} > R \ {-1, 1} be the function given by f(x) = 
a. for every x # +1. For each n € N, find the expression that defines the n—th 
composite function f. 


Solution Firstly, note that f : IR \ {-1, 1} > R \ {-1, 1}. Now, since 


b7@).. 
l+f@ 14+ 


rx 


FOR) = (Ff of@) =fF@)) = 


we have f?) = Idx, the identity function of X = R \ {—1, 1}. This gives us 


fO =fof% =foldy =f and fO =fof =fof = Idy. 


In general, suppose (by induction hypothesis) that we have already proved that 
pee =f and f?4) = Idx, for some integer k > 1. Then, 


pee = fof =fo Idy =f 
and 
pert) = fo fk) =fof = Idy. 


Therefore, we conclude that f) = f whenever n is odd, and f = Idy whenever n 
is even. Oo 


Given a function f : X — Y, we’ve already seen some examples that illustrate 
the fact that the image of f is not necessarily equal to its codomain Y. On the other 
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hand, we can also have two distinct elements of X with the same image via f. For 
an example, consider the quadratic function f(x) = x’, with x € R; for every x € R, 
we have f(x) = x” = (—x)? = f(—x). We attach special names to functions whose 
images coincide with their codomains, or which associate distinct images to distinct 
elements of their domains. This is set in the coming 


Definition 6.36 A function f : X — Y is said to be: 


(a) Injective, or one-to-one, or an injection, if, for every y € Y, there exists at 
most one x € X such that f(x) = y. 

(b) Surjective, or onto, or a surjection, if, for every y € Y, there exists at least one 
x € X such that y = f(x). In other words, this is the same as saying that the 
image of f is all of Y. 

(c) Bijective, or one-to-one onto, or a bijection, if it is simultaneously injective 
and surjective. 


An efficient way of verifying whether a function f : X — Y is injective or not is 
to verify whether the implication 


fa) =f02) > 1 =x (6.6) 


is true for all x,, x. € X. Accordingly, in order to prove that f is surjective, one must 
be capable of, for each y € Y, guarantee the existence of at least one solution x € X 
for the equation f(x) = y. Let’s see some examples. 


Example 6.37 If X C Ris a nonempty set and f : X — X is a function such that 
S((@)) = x for every x € X, then f is a bijection. 


Proof Let x; and x2 be elements of X for which f(x1) = f(x2). According to (6.6), 
in order to show that f is injective it suffices to prove that x; = x. To this end, 
observe that f(x;) = f(x2) implies f(f(x1)) = f(f(%2)) and, then (by using the given 
hypothesis), x; = x2. 

The surjectivity of f also follows from the hypothesis: for a fixed y € X, taking 
x = f(y) € X we get f(x) = f(f()) = y, so that y € Im(f). Oo 


As a particular case of the previous example, the function of inverse proportion- 
ality (cf. Example 6.13) is a bijection from R \ {0} into itself. 


Example 6.38 Let f : [0,1] — [0,1] be a surjective function, such that |f(x)) — 
FS (%2)| < |x1 — x2| for all x1,.x2 € [0, 1]. Prove that there are only two possibilities: 
either f(x) = x for every x € [0, 1], or f(x) = 1 — x for every x € [0, 1]. 

Proof Let a,b € [0, 1] be chosen in such a way that f(a) = 0 and f(b) = 1 (that it 
is possible to make such a choice follows from the surjectivity of f). Then, the given 
hypothesis allows us to write 


1 = |1—0| = |f(6) -f(@| < |b—a| <1, 
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so that |b — a] = 1. However, the only a, b € [0, 1] such that |b — a| = 1 area = 0 
and b = 1, or vice-versa. Suppose that a = 0 and b = 1 (the other case can be 
haldled similarly), and take any c € (0, 1). Triangle inequality (5.10), together with 
the hypothesis on f, give 


1 = |f() —fO)| 
= FQ) —fO| + Fe) —FO)| 
< |1—¢|+ |e—0| 
=(l-—c)+c=1. 
Hence, we must have |f(c) —f(0)| = |c—0| and, since c,f(c) > 0, we get f(c) = c. 
Finally, this is true for every c € [0, 1], so that f(x) = x for every x € [0, 1]. Oo 


The coming proposition teaches us the way injective, surjective and bijective 
functions behave with respect to composition. 


Proposition 6.39 Let f :X — Y and g: Y — Z be given functions. Then: 


(a) gof injective => f injective, but the converse is not necessarily true. 
(b) gof surjective > g surjective, but the converse is not necessarily true. 
(c) g and f injective > g of injective. 

(d) gandf surjective > g of surjective. 

(e) gandf bijective > gof bijective. 


Proof 


(a) For x1,x2. € X, we have 


F(x) = f@2) = gf) = sf) 

> (gof)(x1) = (eo f)Qs) 

=> x1 = X2, 
where, in the last passage, we used the fact that g o f is injective. We now have 
to give an example in which f is injective but g o f is not. To this end, take 
X=Y=Z=R,f(x) =x and g(x) = x. 
Choosing z € Z arbitrarily, the surjectivity of g o f guarantees the existence of 
at least one x € X such that z = (go f)(x). Then, z = g(f(x)), so that g is also 
surjective. For the second part, take X = Y = Z = R, g(x) = x and f(x) = x’; 
then, g is surjective, but g of is not. 
(c) Let x), x2 € X. By using the injectivity of g and, then, that of f, we obtain 


(g of)(x1) = (g of) (x2) => gf) = 8(FO2)) 
= fa) = fQ@2) 


>X, =X, 


(b 


wm 


Hence, g of is also injective. 


168 6 The Concept of Function 


(d) Let z € Z be arbitrarily choosen. The surjectivity of g guarantees the existence 
of y € Y such that z = g(y). On the other hand, the surjectivity of f assures the 
existence of x € X for which f(x) = y. Then, we have 


(gof\(x) = gf@) = gO) =z 


so that g o f is surjective, too. 
(e) It follows from items (c) and (d) that 
g and f bijective = g and f injective and surjective 
= gof injective and surjective 


= gof bijective. 


Let’s revisit Example 6.37 in light of the previous proposition. 


Example 6.40 Let X be a nonempty set. If f : X — X is a function such that 
f of = Idx, then f is a bijection. 


Proof Indeed, since the identity function Idy : X — X is a bijection, it follows from 
items (a) and (b) of the previous proposition that f is injective and surjective, hence, 
bijective. B| 


The material of this section also allows us to study the important concept of 
countably infinite sets, according to the following 


Definition 6.41 An infinite set A is said to be countable if there exists an injective 
function f : A > N. In this case, we shall also say that A is a countably infinite set. 


If A is countably infinite and B C A is infinite, then B is also countably infinite. 
Indeed, by composing an injective function f : A > N with the inclusion: B>A 
(that sends each x € B to itself), we get the injective function fo.: B—> N. In 
particular, every infinite subset of N is countable. 

The set Z of integers is also countably infinite, for one can easily check that the 
function f : Z — N given by 


—2x, ifx <0 


FON aay, Hea st) 


is a bijection. Perhaps a little more surprising is the following 
Example 6.42 The cartesian product N x N is a countably infinite set. 


Proof It suffices to define f : N x N > N by setting 


f(m,n) = 2""'(2n—1), Vm,neéN. 
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A straightforward argument on odd and even numbers shows that f is injective. 
(Actually, the Fundamental Theorem of Arithmetic — cf. introduction to Chapter | 
or Chapter 6 of [5] — assures that f is bijective.) Oo 


With a little more effort (see Problem 21), we can prove that Q is countably infi- 
nite. However, as will be seen in Section 7.4 (cf. Example 7.46), R is uncountable, 
i.e., not countable. In turn, this implies (cf. Problem 22) that I = R \ Q is also 
uncountable. For yet another example of an uncountable set, see Problem 23. 

The following lemma shows that the elements of every countably infinite set can 
be written as terms of a sequence. 


Lemma 6.43 [fA is a countably infinite set, then there exists a bijection f :N > A. 
In particular, letting a, = f(n), we get A = {a1, a2, a3,...}. 


Proof By definition, there exists an injective function g : A > N. Then, g induces 
a bijection (which we will also denote by g) from A to B = Im(g) C N. If we 
construct a bijection h : N > B, then f = goh: N > A will also be a bijection. 

To what is left to do, we start by letting b} = min B and setting h(1) = by. Since 
Bis infinite, we can let b) = min(B \ {b;}) and set h(2) = bp. By the same token, let 
b3 = min(B \ {b1, b2}) and set (3) = b3. Continuing this way, we define a function 
h: N= B such that, letting h(k) = b;, we have b} = min B and 


by = min (B \ {by,..., be-1}) 


for every natural k > 1. In particular, b} < by < b3 < --- and his injective. 

If h was not surjective, there would exist b € B such that b # by, bo, b3,.... If 
by < b for every k € N, then B Cc {1,2,...,D}, a contradiction to the fact that B 
is infinite. Hence, there would exist a natural m > 1 such that by_; < b < Dy. 
However, since 


h(m) = min (B \ {h(),...,a(m — 1)}) 
= min (B \ {b1,... Ont}. 
we should have defined h(m) to be b, instead of b,,. Since this is a contradiction, we 


conclude that h is indeed surjective. oO 


The concept and properties of countable sets give rise to interesting results, one 
of which we collect in the following 


Example 6.44 Show that it is possible to partition the set of natural numbers in two 
sets A and B satisfying the following conditions: 


(a) Neither A nor B contains the terms of an infinite and nonconstant AP. 
(b) For all distinct x, y € A, we have |x — y| > 2016. 


Proof An infinite and nonconstant AP of naturals is characterized by the ordered 
pair (a,r), where a is its first term and r is its common difference. Since N x N 
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is countable, the same is true of the family P of infinite and nonconstant AP’s of 
naturals. Hence, we can write P = {51, 52, 53,...}. 

Now, let A; be the set whose elements are the terms of s;, and let A be defined in 
the following way: take x; € Aj, x2 € Az, ...such that x2 — x, > 2016, x3 — x2 = 
2017, x4 — x3 => 2018, ... (that this is possible follows from the fact that each A, is 
an infinite set). Then, let B= N \ A. 

It is clear that A does not contain the terms of an infinite AP of common 
difference r € N, for the differences x; — xj; are eventually all greater than r. 
On the other hand, B does not contain the terms of an infinite and nonconstant AP 
either, for, if it did, we should have B D A, for some k € N; however, x, € A, \B.O 


Problems: Section 6.3 


1. Let f and g be real functions of a real variable, given by f(x) = x — ; and 
g(x) = x2? - ‘ Find the solution set of the inequality |(g of)(x)| > (g of)(x). 

2. Let f and g be real functions of a real variable, such that f(x) = 2x + 7 and 
(f 0 g)(x) = x? — 2x + 3, for every x € R. Find the expression that defines g(x) 
in terms of x. 

3. Let f,g : R > R be such that g(x) = 2x —3 and (fo g)(x) = 2x7 — 4x +1. 
Find the expression that gives f(x) in terms of x. 

4. Let f and g be real functions of a real variable, given by f(x) = ax + b and 
g(x) = cx + d for every x € R, with ac 4 0. Show that 


fog=sgof @ (a—1)d=(c—1)b. 


5. Letf : IR — R be the function defined by 


_ § #4, itx 4p 
SE dey ap 


If f(f(x)) = x for every real x, compute the possible values of b. 

6. * Let 7 C R be an interval and f : J — R be an increasing or decreasing 
function. Prove that f is injective. 

7. Let I,J C R be intervals and f : J — J and g : J > R be given functions. If f 
and g are increasing (resp. if f is increasing and g is decreasing, or vice-versa), 
prove that g o f is also increasing (resp. decreasing). 

8. Let f : R \ {0} — R be a function such that 


root (=) =1 and r(x+2) =r) +1(=). 


for every real x 4 0. If wand v are nonzero real numbers such that u? + v? = 1 
and f (+) = 2, compute the value of f (+). 


6.3 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 
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. Given a function f : X — Y, we define its graph* as the subset G, of the 


cartesian product X x Y given by 


Gy ={@y) EXXY; y=fQ@). 


Let F : X — G; be the function defined by F(x) = (x, f(x)), for every x € X. 
Prove that F is a bijection. 

* Let @ # X C R bea union of intervals, which is symmetric with respect to 
0 € R. We say that a function f : X — R is even (resp. odd) if f(x) = f(—x) 
(resp. f(x) = —f(—x)), for every x € X. If @ 4 X C Ris as above, prove that 
every function f : X — R can be written, in a unique way, as a sum of an even 
and an odd function with domain X. 

Let f : R \ {0} > R be a function such that f (¢) = f(a) — f(b), for every 
nonzero real numbers a and b. Prove that f is an even function. 

Let f : R — R be an odd function. Decide whether the function f 0 f is even, 
odd, or not even nor odd. 

Let g : R > R be an odd function, such that g(x) > 0 whenever x > 0. Show 
that there exists a function f : R > R for which g = f of. 

Find all real values of & for which the image of the function f : R \ {-1} > R 
given by f(x) = ae thet equals R \ (—L, L), for some positive real L. 
* A function f : R > R is periodic if there exists a smallest positive real 
number p, called the period of f, such that f(x + p) = f(x) for every x € R. 
Given a periodic function f : R — R, of period p > 0, do the following items: 


(a) Let g : R — R be also periodic of period p. If f(x) = g(x) for every 
x € [0,p), prove that f = g. 

(b) Given a € R \ {0}, prove that the function g : R — R, given by g(x) = 
f (ax), is periodic of period tr 


(Italy) Let f : R — R be a function such that f(10 + x) = f(10 — x) and 
f(20 + x) = —f(20 — x), for every real x. Prove that f is odd and find p > 0 
such that f(x + p) = f(x), for every x € R. 

(IMO - adapted) Let f : R — [0, 1] be a function such that, for a certain a € R, 
we have 


1 
fe +a)= 5+ V7@) FOP, 


for every x € R. Prove that f is periodic. 
(Brazil) The function f : Z — R is such that f(x) = x — 10 for x > 100 and 
f(x) =f(@ + 11)) for x < 100. Find the image of f. 


4Graphs of real functions of a real variable will be one of the main objects of study along these 
notes, starting from Section 6.6. 
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(Hungary) Let f : N > N be a function satisfying the following conditions: 


(a) f() =1. 
(b) f(2n) = 2f(n) +1. 
(c) ff) = 4n +1. 


Compute f(1993). 

Give an example of a surjective function f : N — N such that, for every n € N, 
the set {x € N; f(x) = n} is infinite. 

* The purpose of this problem is to show that Q is countably infinite. To this 
end, do the following items: 


(a) If sets A and B are either finite or countably infinite sets, show that A x B is 
either finite or countably infinite. 

(b) Construct a surjective function from Z x N into Q. 

(c) Conclude that there exists a surjection f : N — Q and, then, that Q is 
countable. 


* The purpose of this problem is to show that I = R \ Q is uncountable, 
assuming that R itself is uncountable (this will be proved in Example 7.46). 
To this end, do the following items: 


(a) Let A}, A>, A3,... be a countably infinite collection of sets. Show that there 
exist B, C A, such that B), Bo, B3,... are pairwise disjoint and Urs Ay = 


ens Bx. 
(b) In the notations of (a), if each of A,, Az, A3, ... is finite or countably infinite, 


show that (_),., Ax is also finite or countably infinite. 
(c) Show that I is uncountable. 


* Let F be the family? of infinite subsets A of N such that N \ A is also infinite, 
1.€., 


F ={ACN;A and N\A are infinite.}. 


Show that F is an uncountable set. 


6.4 Inversion of Functions 


Among all functions f : X — Y, the case of a bijection is, in a certain sense, the 
best possible. Indeed, in this case the elements of X and Y are in a one-to-one onto 
correspondence, so that to each element of X there corresponds a single element of 
Y via f, and vice-versa. When this happens, we can form a function g : Y > X by 
asking that 


f(x) =y @ gy) =x. 


5A family is a set whose elements are sets themselves. 
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At this point, a natural question is this: why cannot we use the same declaration 
above to define g when f is not bijective? From an intuitive point of view, if f is not 
surjective, then there exists an element y of Y which is not the image of any element 
of X via f; therefore, there is no natural way of using f and y to define g(y). On the 
other hand, if f is not injective, then there exist distinct elements x;,x2. € X with the 
same image y € Y via; if we were to try to define g by using f, there would also 
be no natural way of deciding which of x; and x2 should be equal to g(y). 

Back to the case in which f is bijective, it is not difficult to see that g, defined 
as above, is indeed a function, moreover such that (g o f)(x) = x, for every x € X, 
and (f o g)(y) = y, for every y € Y. Yet in another way, we have go f = Idy 
and f o g = Idy. Conversely, if f : X — Y and g : Y — X are functions such 
that go f = Idy and fo g = Idy, then Proposition 6.39 guarantees that f must 
be a bijection, and Problem | assures that g is the only function satisfying these 
compositions. 

We summarize the above discussion in the following 


Definition 6.45 Let f : X — Y bea given bijection. The inverse function of f is 
the function g : Y > X such that, for every y € Y, 


gy) =xey=f). 


From now on, whenever there is no danger of confusion, we shall denote the 
inverse (function) of a bijection f : X > Y by f—! : ¥ + X. Notice that the exponent 
—1 in the notation of inverse function has no arithmetic meaning; it simply calls the 
reader’s attention to the fact that f—! does the opposite of f, i.c., applies Y into X 
instead of X into Y, and reverses the arrows of the correspondences made by f. 

Now, it naturally emerges the question of how to effectively compute the inverse 
of a given bijection. Although such a computation is generally more complicated 
than that of composite functions, in the case of real functions of a real variable f : 
X — Y we can reason in the following simple way: since f—!(y) = x & f(x) = y, 
in order to find f—!(y) = x it suffices to solve, for x € X, the equation f(x) = y. 
If we find a single solution x € X for every y € Y, then f will be a bijection and 
x (computed in terms of y) will give the sought for expression for f—!(y). Let’s 
implement such a reasoning in some relevant examples. 


Example 6.46 Let a and b be given reals, with a 0, and let f : R > R be the 
affine function given by f(x) = ax + b, for every x € R. Show that f is a bijection 
and find the expression of f~!. 


Proof Initially, note that 


—b 
f® =yeartba=yexua=V. 
a 
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Hence, f is a bijection and, by the definition of f~', this value of x is precisely equal 
to f—!(y), so that 


P=. 


a oO 


Example 6.47 If X is a nonempty set, then, analogously to the former example, the 
inverse function of the identity function Idy is this function itself, so that we can 
write (Idy)~! = Idy. Nevertheless, the inverse of a bijective function can be the 
function itself, even if it is not the identity function of some nonempty set; an 
example is furnished by the inverse proportionality function f : R \ {0} > R \ {0} 
(which we already know to be bijective), for which f(x) = 1, for every x € R \ {0}. 
Indeed, since : 


1 
tS a ia ad , 
we conclude that 
- 1 
fo) =x= - 
y 
and, hence, f—! = f. 


Example 6.48 Let a, b and c be given real numbers, with a > 0. Propositions 6.15 
and 6.23 guarantee that the quadratic function f(x) = ax?-+bx-+c, seen as a function 


— b A 
F:[-gpte0) > [-g.t0). 


is a bijection. Compute the expression of its inverse. 


Solution According with the discussion that preceded Example 6.46, in order 
to get the expressio of f-! : [-2, +00) > [-£, +00), it suffices to fix 
y € [—4,+00) and solve, for x € [—4,+00), the equation f(x) = y, ie., 
ax? + bx + c—y = 0. Since this is a second degree equation in y, the condition 
x= -£ gives (remember that a > 0) 


—b+ Vb? —4a(c—y) _ —b+ JA + 4ac 


2a 2a 


where A = b* — 4ac is the discriminant of f. Therefore, 


—b+ JK + 4ay 


—] = 
fo) = 
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Example 6.49 As a particular case of the previous example®, the function f : 
[(0, +00) + [0, +00), given by f(x) = x? for every nonnegative real x, is a bijection 
having as its inverse the square root function 


f-! : [0, +00) —> [0, +00) 
x > fx - 


We finish this section obtaining a useful formula relating composition and 
inversion of bijections. 


Proposition 6.50 [ff : X — Y and g : Y > Z are bijections, then go f : X — Z is 
also a bijection and 


(gofyt=flogt, 


Proof We already know, from item (e) of Proposition 6.39, that gof is bijective. On 
the other hand, since (g0f)~! and f—! o g~! are both functions from Z to X, in order 
to check that (go f)~! = f—! 0 g7! it suffices, by the uniqueness of the inverse (cf. 
Problem 1), to show that 


(flog ')o(gof) = Idy and (gof)o(f-! og!) = Idz. 


These verifications are quite straightforward and will be left for the reader as 
exercises. Oo 


Problems: Section 6.4 


1. * Let f : X — Y bea given function. 


(a) If g : Y — X is such that go f = Idy and fo g = Idy, prove that f is a 


bijection. 
(b) Prove that there exists at most one function g : Y — X for which gof = Idy 
and f o g = Idy. 


2. * Complete the proof of Proposition 6.50, showing (in the notations of its 
statement) that 


(ft og')o(gof) = Idx e (gof)o(f' og”) = Idz. 


Here and in the previous example, we are relying on previous knowledge from the reader, namely, 
the existence of square roots of positive real numbers. Therefore, the present discussion will be 
rigorously justified only after Theorem 7.9, where we prove the existence of such roots. In this 
respect, the reader might also want to take a look at Example 8.36. 
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7. 


10. 
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Let f : [5. +oo) > [3. +00) be the function given by f(x) = x7 — x + 1, for 
every x > 5: Show that f is a bijection and obtain the defining expression of its 
inverse. 


. Letf : R \ {2} > R \ {3} be the function given by f(x) = x. Show that f is 


a bijection and obtain the defining expression of its inverse. 


. Let a,b,c,d € R* andf : R\ {-4 — R be the function given by f(x) = 


a 5 Generalize the previous problem, showing that f defines a bijection from 


R \ {-4 onto R \ {4}. Moreover, conclude that, if d = —a, then f~! = f. 


. * Let n be a natural number and f : [0,+0co) — [0,+00) be the function 


given by f(x) = x”. Admitting that Im(f) = [0, +00) (a fact to be proved in 
Example 8.36), show that f is an increasing bijection and get the expression for 
f_', thus defining the n-th root function. 

Give an example of a bijection f : R — R for which both f + f—! and f — f7! 
are also bijections. 


. * LetI,J C R be intervals and f : J > J be a bijection. If f is increasing (resp. 


decreasing), prove that f—! : J + J is also increasing (resp. decreasing). 


. Let f : R > R be a bijection and g : R x R > R x R be the function defined 


by g(x, y) = («°, x —f(y)). Prove that g is bijective and find the expression that 
defines its inverse in terms of f—!. 

(IMO) Let G be a (nonempty) set of affine functions possessing the folllowing 
properties: 


(a) Iff,g €G, thenfogeG. 
(b) Iff € G, thenf—! € G. 
(c) For every f € G, there exists xp € IR such that f(x¢) = xp. 


Prove that there exists a real number xo such that f(xo) = xo for every f € G. 
(France) Let f : N — N be a bijection. Prove that there exist natural numbers 
a<b<csuch that f(a) + f(c) = 2f(d). 


6.5 Defining Functions Implicitly 


A function can be defined implicitly by a set of properties. For example, letting 
g(x) = x + 1 and h(x) = x — 1 for every x € R, the function f : R — R given by 
f(x) = x’ satisfies 


1.€., 


f(gQ) = g(a) and f(h(x)) = AQ), 


it is such that 


fet) =@41% =x 42x41 and f@4+1 =@-1% = 2-241; 


therefore, for every x € R, we have 


f«+tD)—-f@—-1) = 4. 
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Now, we can try to reverse the steps above, asking which functions f : R > R are 
such that 


fxtD—fxe—-D) =4x, VxER. (6.7) 


Clearly, f(x) = x? is not the only one, for, f.(x) = x?+c also satisfies (6.7), whatever 
the real constant c. 

Now, let f : R — R be any real function satisfying (6.7), so that f is not explicitly 
given by its values. Since we merely know that f must satisfy that relation, we say 
that f is implicitly defined. 

On the other hand, since we know that (6.7) must be satisfied, it allows us to find 
other relations f must also satisfy. For instance, if g : R > R is given by g(x) = x’, 
then 


f(g) + D) —f(g@) — 1) = 4g) 
or, which is the same, 
fi? +1)-f(?-1) = 4°, VxeR. (6.8) 


Thus, any function f : R — R satisfying (6.7) will also satisfy (6.8). Nevertheless, 
relation (6.8) may not be very useful to the task of finding all functions f : R > R 
that satisfy (6.7). 

It is our purpose in this section to approach the problem of finding all functions 
implicitly defined by a certain (finite) set of relations. Since there is no general 
theory to accomplish this, we shall content ourselves in analysing a number of 
interesting examples. In turn, these examples will provide us with several useful 
techniques to be employed in such problems. 

Let’s start by looking at two simple examples. 


Example 6.51 (Canada) Find all increasing functions f : N > N, such that f(2) = 
2 and f(mn) = f(m)f(n), for every m,n € N. 


Solution From 1 < f(1) < f(2) = 2 we get f(1) = 1; also, f(4) = f(2)f(2) = 4 
and f(8) = f(4)f(2) = 8. Suppose, by induction hypothesis, that f(2") = 2* for a 
certain natural number k. Then, 


FAM) = fQ2F(Q) = 2-2 = 21, 


and it follows that f(2”) = 2” for every nonnegative integer n. 
Now, let 7 be a fixed natural number. Since f is increasing, we have 


Qn = f(2”) < f(2” + 1) ears ago _ 1) “for — grt, 
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However, f(2” + 1), f(2” + 2), ...,(2”*! — 1) are natural numbers too, so that the 
only possibility is 


OAD) S24 1 74 ya 2 SD, FO SH) SP = 1, 


Finally, since such a reasoning is valid for whatever natural n, we conclude that 
f(m) = m, for every m € N. Oo 


Example 6.52 (OIM) If D = R— {-1,0, 1}, find all functions f : D > R such 
that, for every x € D, we have 


rot (= *) = 64x. 


Solution First of all, note that, since x 4 0, we have f(x)? ( i 1F) # 0 for every 
x € D. In particular, f(x) 4 0 for every x € D. Now, let g(x) = ; at for x € D. The 
definition of D easily gives that g(D) C D, so that we can compose f and g. This 


way, for every x € D, we have 


g(x) 


reece (2 


) = 649(x). (6.9) 


Substituting the defining formula for g in the above relation, we arrive at the equality 


or, which is the same, 


bax? = 64 1l-x 
#(7) 10 = (=). 


for every x € D. Squaring both sides of the relation given in the statement of the 
problem and dividing the result by the last relation above, we find 


3 2 1-x 
Fx) = 64x (=). 


so that f(x) = 43/x2 (=). 

Up to this point, we have only shown that, if there exists a function f : D—> R 
satisfying the stated relation, then it must be given by the last expression above. 
Hence, it is necessary that we verify that such an f does satisfy that relation, for 
every x € D. Since such a verification is straightforward, we leave it to the reader. 

oO 
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Yet in respect to the previous example, a little more practice would have allowed 
us to get rid of formally defining g to, then, compose it with f in order to get (6.9). 
Instead, we could have just said 


Xx 
Substituti. b 
ubstituting x OT as 


in the given relation, we obtain ..., 


keeping in mind that this substitution is actually a composition of functions. From 
now on, whenever there is no danger of confusion, we shall adopt this language 
shortcut, which already appears in the next example. When reading it, try to identify 
the compositions that correspond to the employed substitutions. 


Example 6.53 (Poland) Find all functions f : R — R such that, for every x,y € R, 
we have 


(x— yf +y)—@+ yf —y) = 4xy0? —y’). 


Solution Since this relation is valid for every x, y € R, it must be valid if we make 


ath and y = om with a,b € R. Substituting these values for x and y, we get 
the relation 


x= 


bf (a) — af (b) = (a’ — bab, 


which must be satisfied for every a,b € R. In particular, when ab ¥ 0, dividing 
both sides of the last relation above by ab we get 

b 
f@ _ _ =a —D’, 


a 


for all a,b € R \ {0}. Hence, letting g : R \ {0} — R be the function given by 
g(x) = i —x?, it follows from the above that g(a) = g(b), for all a,b € R\ {0}. In 
other words, g must be constant, so that there must exist a real number k such that 
g(x) = k, for every x € R \ {0}. But this is the same of saying that f(x) = x° + kx, 
for every x € R \ {0}. 

On the other hand, making x = y = 1 in the relation given in the statement of 
the problem, we get f(0) = 0, for every function f : R — R that satisfies those 
conditions. Since 0? + k-0 = 0, we conclude that every such function must be of 
the form f(x) = x° + kx, for every x € R and some real constant k. 

As in the previous example, we have to verify that every such function does 
satisfy the stated conditions. This amounts to straightforward algebrism, which will 
be left to the reader. Oo 


For the next example, we need the following 


Definition 6.54 Let X be a nonempty set and f : X — X be a given function. An 
element xo € X is said to be a fixed point of f if f(xo) = Xo. 
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If 7 C Ris an interval, then a decreasing function f : J + J admits at most one 
fixed point. Indeed, if x;, x2. € J were fixed points of f, with x; < x2, it would follow 
from the fact that f is decreasing that 


x1 = f(%1) > f@2) = x2, 


a contradiction to x} < Xx. 


Example 6.55 (Argentina) Let f : R — R be a decreasing function such that 
f(at+f(x)) = x + f(x), for every real number x. Prove that f(f(x)) = x, for every 
real x. 


Proof The hypotheses on f guarantee that x + f(x) is a fixed point of f, for every 
x € R. On the other hand, according to the previous discussion, the decreasing 
character of f assures the existence of at most one fixed point for it. Therefore, there 
must exist a € R such that x + f(x) = a, for every x € R, so that f(x) = a — x for 
every x € R. Hence, 


F(f@)) =fl@—x) =a-(a—-x) =x, 


for every x € R. Oo 


The next example develops a body of ideas which will reveal themselves to be 
useful in a number of other situations involving implicitly defined functions. In 
particular, the first part of the presented reasoning solves Problem 12, page 152. 


Example 6.56 Find all functions f : R — R such that f(1) = 1 and, for all 
xyeER: 


(a) fe + y) =f) +f). 
(b) fay) = f@FO). 


Solution Let f be a function satisfying the stated conditions. Making x = y = 0 in 
(a), we get 


F(0) = f(0 + 0) = f(0) + FO) = 2f(0), 
so that f(0) = 0. Making y = x in (a), we obtain 

f(2x) =f@+ x) =f) +f) = 2f@), 
for every x € R. Now, making y = 2x in (a), it comes that 


F(3x) = f(x + 2x) = f(x) + f(2x) = f(x) + 2f() = 3f@), 
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for every x € R. Repeating the above reasoning we easily conclude, by induction 
onn €N, that 


f(nx) = nf(x), VneN,xeR (6.10) 


In particular, making x = 1 in (6.10), we get f(n) = n, for every n € N. Letting 
x = + in (6.10), it follows that 


1 1 
1=say=s(n-*) = nr (=) 
n n 


and, hence, f (4+) = +. Finally, x = + in (6.10), with m € N, furnishes 


n 1 1 1 n 
r(=) =f(n-—)=nf(—)=n--=-. 
m m m mom 
In order to see what happens with negative rationals, let y = —x in item (a), to get 
0=f(0) =f + (-x)) =f@) +f»), 
or, which is the same, 
f(x) = -f(-x), VxeER. (6.11) 
In particular, if x < 0 is rational, it follows from (6.11), together with the fact that 
—x is a positive rational, that f(x) = —f(—x) = —(—x) = x. Therefore, f(x) = x, 
for every x € Q. 
Since f(x) = x for every x € Q, we suspect that the identity function of R is the 
only one satisfying the stated conditions. To confirm that, we now turn our attention 


to item (b). First of all, let’s show that, if f(x) = 0 for some x € R, then x = 0. 
Indeed, if x ~ 0, then, making y = + in item (b), we would have 


o=foor (>) =r(+-2) =s0) = 1. 


which is a contradiction. Now, letting y = x 4 0 in (b), we obtain 


f@*) =f(x-x) =f) -f@) = f(x) > 0; (6.12) 


hence, if x, y € R, with x < y, anda ¥ Ois such that y— x = a’, then, successively 
applying (a), (6.11) and (6.12), we get 


fQ) —fO) =f) +f(—) =fo-» =f(@) =f@ > 0, 


so that f is an increasing function. 
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Finally, suppose that there exists a € R such that f(a) < a. We invoke the 
result of Problem 4, page 206, according to which there exists a rational number 
between any two real numbers. This allows us to choose a rational number r such 
that f(a) < r < a, and the increasing character of f gives 


r=fr) <f@, 


which is a contradiction. Analogously, we cannot have f(a) > a, so that the only 
possibility is f(a) = a. However, since a € R was arbitrarily chosen, we conclude 
that f(x) = x for every x € R. Oo 


Our last example shows that, for implicitly defined functions f : N > N, 
elementary divisibility arguments are sometimes useful. 


Example 6.57 (Lituania) Find all functions f : N — N such that, for all natural 


numbers m and n, we have 


f(f(m) + fin) = m+n. 


Solution Let’s first prove that f is injective. To this end, let m and n be natural 
numbers such that f(m) = f(n) = k. Then, f(2k) = f(f(n) + f(n)) = 2n and, 
analogously, f(2k) = 2m, so that m = n. 

Now, let k > 1 be a natural number. From (k — 1) + 2 = k + 1, it follows that 


fFK-Y)+f2))=kK+1=f(Fk) +f0)). 


By the injectivity of f, we then get f(k — 1) + f(2) = f(k) + fC.) or, which is the 
same, f(k) —f(k — 1) = f(2) —f(), for every k > 1. Writing this relation for 
k =2,3,...,n and adding the corresponding equalities, we arrive at 


f(y) = @-DEFQ)-FU) +f), (6.13) 


for every natural n > 1. Letting n = 2f(1) in the above relation, it comes that 


2=f(fC) +f0)) =fe2fO)) = AFM) — DF) —fA)) +f) 
and, hence, 


4=F(1) 
=i 


Since f(2) — f(1) is an integer, we conclude that 2f(1) — 1 divides 2 — f(1). In 
particular, 2f(1) — 1 < |2—f(1)], and from this inequality it is easy to conclude that 
the only possibility is f(1) = 1, so that f(2) = 2. It thus follows from (6.13) that 
f(n) = n, for every natural n. Oo 


f2)-fW) = 


6.5 


Defining Functions Implicitly 183 


Problems: Section 6.5 


1. 


10. 


* Generalize the discussion of the paragraph that precedes Example 6.55, 
showing that, if 7 C R is an interval and f,g : J — R are functions such 
that f is decreasing and g is increasing, then there exists at most one xo € J for 
which f (xo) = g(x0). 


. Find all positive reals x for which ./2 + /x = g 
. Find all functions f : Q — Q such that 


() _ f@) +f) 
2) oi : 


for allx,y¢Q. 


. (Austria) Find all functions f : Z \ {0} > Q such that, for all x, y € Z \ {0} for 


which x + y is a multiple of 3, we have 


(=) _ f@) +f) 
3 J) 5 


. (Vietnam) Find all functions f : R — R such that 


seo) + she) FY Oe = 7. 


for all x,y,z € R. 


. (Spain) Find all increasing functions f : N > N such that f(n + f(n)) = 2f(n), 


for everyn € N. 


. (OIM shortlist) Find all functions f : R + Z which satisfy the following set of 


conditions: 


(a) f(x +a) =f (x) + a, for every x € R and every a € Z. 
(b) f(f(x)) = 0, for x € [0, 1). 


. (Austrian-Polish) Prove that there doesn’t exist a function f : Z — Z such that, 


for all x, y € Z, we have 


f(x +f) =f@) —y. 


. (Romania) Find all functions f : Z — Z such that f(0) = 1 and 


SFR) + fk) = 2k + 3, 


for every k € Z. 
(Romania) Let k > 1 be an odd integer and A = {x,,x2,...,x,} be a set of k 
real numbers. Find all injective functions f : A — A such that 


f(x1) — x1] = [F(v2) — x2] = +++ = [FP re) — xe. 
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11. (BMO) Let a be a given real number and f : R — R be a function such that 
f(O) = ; and 


fa@t+y) =f@pfla—y) +fofla—, 


for all x, y € R. Prove that f is constant. 
12. Find all functions f : Q > Q% such that f(x + y) = f(@)f(), for all x, y € Q. 
13. Find all functions f : [0, 1] — [0, 1] such that f(0) = 0, f(1) = 1 and 


f(xt+y) +f —y) = 2f@), 


for all x, y € [0, 1] such that x — y,x + y € [0, 1]. 
14. (Lituania) Let f : Z — Z be such that f(m? + f(n)) = f(m)? + 2, for all 
m,n é€ Z. 


(a) Prove that f(0) = O and f(1) = 1. 
(b) Find all such functions. 


15. (OIM) Find all increasing functions f : N > N such that f(yf(x)) = x°f (xy), 
for every x,y € N. 

16. (IMO) Find all functions f : [0,+co) — R satisfying the following set of 
conditions: 


(a) FOF) FO) =f + y), for all x, y € [0, +00). 
(b) f(2) = O and f(x) ¥ 0, for0 < x < 2. 
17. (MO) Let S = {x € R; x > —1}. Obtain all functions f : S > S satisfying the 


following conditions: 


(a) f(x + f(y) + f(y) = y +f) + yf()), for every x,y € S. 
(b) is increasing in each of the intervals (—1, 0) and (0, +00). 


18. (IMO) Decide whether there exists a function f : N — N satisfying the 
following set of conditions: 


(a) f() =2. 
(b) f(n) <f(n + 1), for everyn € N. 
(c) f(f(a)) =f(n) +n, for every n €N. 


19. (Iran) Find all functions f : R — R such that 


FF @+y) =f&+y) +f@OFO) — xy, 


for allx,y € R. 
20. (Poland) Find all functions f : Q*, — Q*. satisfying, for every positive rational 
number x, the following set of conditions: 


(a) fet) =f@ +1 
(b) f0°) = FO). 


6.6 Graphs of Functions 185 
6.6 Graphs of Functions 


Given a function f : X — Y, we define its graph to be the subset Gy of the cartesian 
product X x Y, defined by 


Gy = {@,y) EXXY; y=f@)}. (6.14) 


If f : X — R is a real function of a real variable, so that X C R is a finite 
union of intervals (possibly X = R), then the graph of f has considerable geometric 
importance, for 


Gy CXXYCRXR, 


and this last set can be identified with the euclidean plane, furnished with a fixed 
cartesian coordinate system’. 

Our purpose in these last two sections of the chapter is to examine some 
elementary examples and study some simple properties of graphs of functions 
f :X — R, when X C Ris a finite union of intervals. We postpone the discussion 
of the main properties of graphs of continuous and differentiable functions to 
chapters 8 and 9. 

In all that follows, we fix a cartesian coordinate system is in the plane. 

The first point worth observing is that not every subset of the cartesian plane 
can be the graph of a function. Indeed, suppose f : X — R is a real function of a 
real variable, X being a finite union of intervals. If (xo, yo) € Gy, then, by the very 
definition of graph, x9 € X and yo = f(x). On the other hand, for a fixed x9 € X, if 
Aj (Xo, y1) and A2(x9, y2) are points on the graph of f, then, again from the definition, 
we have 


yi =f (Xo) = ya, 


so that A; = Ap. In short, for a given x) € X, the vertical line x = xo of the 
cartesian coordinate system intersects the graph of f if and only if xy € X; moreover, 
in this case such a line intersects the graph in exactly one point. Thus, the subset C 
depicted in Fig. 6.5 as a continuous curve doesn’t represent the graph of any function 
f : [-3,3] — R, since every vertical line contained in the gray strip intersects C in 
more than one point. 

The graph of a real function of a real variable gives a quite simple geometric 
interpretation for the image of the function. To discuss it, let’s consider the cartesian 
plane of Fig. 6.6, in which the graph of a function f : [a,b) — R is drawn and its 
points of intersection with the horizontal line y = yo are marked. (There are three 
such points in Fig. 6.6, whose abscissas are denoted by a, f and y.) Let (x0, yo) be 


™We refer the reader to Chapter 6 of [4] for an adequate presentation of cartesian coordinate 
systems. 


186 6 The Concept of Function 


Fig. 6.5 A subset of the cartesian plane which is not the graph of a function 


Fig. 6.6 Image x graph 


an intersection point of the line and the graph. By pertaining to the graph of f, the 
point (xo, yo) must be such that x9 € [a, b) and f(x) = yo, so that x9 = a, B or y. 
Conversely, let be given a point (xo, yo) of the cartesian plane, with xo € [a, b). It is 
clear that (xo, yo) belongs to the horizontal line y = yo; moreover, if xo is a solution 
of the equation f(x) = yo, ie., if fo) = yo, then x = a, f or y and we also 
have (xo, yo) € Gy. Therefore, the horizontal line y = yo intersects the graph of f 
exactly when yo belongs to the image of f. The reasoning for an arbitrary function 
f :X > R, with X C R, is entirely analogous and allow us to conclude that 


The image of f is precisely the set of yo € R for which the horizontal line 


y = yo intersects the graph of f. 
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Fig. 6.7 Minimum point of f: 7 —> R 


Now, if 7 C R is an interval, then the monotonicity of a function f : J > R also 
says a lot about the behavior of its graph. For instance, if we suppose that f increases 
(resp. decreases) in J, we conclude that, as long as the x increases in J, the values 
f(x) increase (resp. decrease) in R, so that the graph of f rises (resp. falls). 

On the other hand, if yo € R is the minimum value of f : J > R and xp € Jisa 
minimum point of f (cf. Definition 6.24), then, for every x € J, the point (x, f(x)) is 
above or coincides with the point (x, yo) (cf. Fig. 6.7). Yet in another way, the graph 
of f is entirely contained in the closed upper halfplane determined by the horizontal 
line y = yo, touching it at the point (x0, yo). 

Notice that the concepts of maximum value and maximum point of a function 
f : 1 — R admit geometric interpretations analogous to those discussed above. 

In the rest of this section we shall examine some important examples of graphs 
of functions. 


Example 6.58 Letf : R — R bea constant function, with f(x) = c for every x € R. 
The graph of f is the set 

Gr = {(@, y); x € Rand y=c} = {(@,c); x € R}, 
i.e., it is the horizontal line y = c, which crosses the vertical axis at the point (0, c) 
(cf. Fig. 6.8, where we consider the case c > 0). 
Example 6.59 Recall (cf. Definition 6.2) that the identity function Idg : R > Ris 
given by Idpg(x) = x, for every x € R. Therefore, its graph is the set 


Giap = {(x, y); x € R and y = x} = {(x, x); x € R}. 


It is an easy exercise in euclidean geometry to verify that the points of the cartesian 
plane of the form (x, x) or (—x, x) are precisely those situated on the bisectors of the 
angles formed by the coordinate axes, those of the form (x, x) belonging to the first 
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(0, ¢) Gy 


Fig. 6.8 Graph of the constant function f(x) = c, Vx € R 


Fig. 6.9 Graph of the identity function Idg 


or third quadrants. Hence, the graph of the identity function Idg is the line shown 
in Fig. 6.9, and, from now on, will be called the bisector of odd quadrants. Observe 
that the set of points of the form (x, —x), i.e., the bisector of even quadrants, is the 
graph of the function f : R — R given by f(x) = —x. 


Example 6.60 The modular function is the function f : R > R given by f(x) = 
|x|. It follows immediately from the definition of modulus of a real number that 

Gr = {(, |x|); x € R} 

= {@, |x|); x € Ry} U {@, |x); x € R-} 

= {(x, x); XE R+} U 105 —x); XE R_}. 


Since the points (x, —x) and (x, x) are symmetric with respect to the horizontal axis, 
the graph of the modular function is obtained by reflecting, along the horizontal 
axis, the portion of the graph of the function Idg situated in the third quadrant (cf. 
Fig. 6.10). 


6.6 Graphs of Functions 189 


Fig. 6.11 Graph of the affine function f(x) = ax + b 


Example 6.61 If f(x) = ax + bis an affine function, then its graph is the subset of 
the cartesian plane given by 


Gy = {(, y)3 x,y eR and y= ax + b}. 


According to basic analytic geometry (cf. Chapter 6 of [4], for instance), the graph 
of f is the line of equation y — ax — b = 0, with slope a and passing through the 
points A (—2, 0) and B = (0, b). Fig. 6.11 depicts the graph of f(x) = ax + b in the 
case a,b > 0. 


For what comes next, recall (cf. Chapter 6 of [4], for instance) that, given in the 
plane a point F and a line d, with F ¢ d, the parabola of focus F and directrix d 
(cf. Fig. 6.12) is the locus of the points P in the plane for which 


PF = dist(P, d), 
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Fig. 6.12 Parabola of focus 
F and directrix d 


where dist(P, d) stands for the distance from P to d. The axis of the parabola is the 
line that passes through F and is perpendicular to d, while its vertex is the point V 
where it intersects its axis. 

We now show that the graph of every quadratic function is a parabola. More 
precisely, we have the following result. 


Theorem 6.62 Given a,b,c € R, with a 4 0, the graph of the quadratic function 


f(x) = ax? +bx+c is the parabola whose axis is the line x = -£ and whose vertex 


bf b A z “e a 2. “ce ” 
is V (-, —7). Moreover, it “opens upwards” if a > 0, and “opens downwards 
ifa<0. 


Proof Let’s look for xo, yo, k € R such that yo 4 k and, letting F(x, yo) and d be 
the line y = k, we have 


P&G; => PF = dist(P, d). 
To this end, set P(x, y). We first notice that 
PeG Sy=artbxte 
also, the formula for the distance between two points of the cartesian plane gives 
PF = dist(P,d) => J/(—»)?+0—yy = ly-A. 
Therefore, we want to find xo, yo and k such that 


y=axrt+bxte ° (x — x9)’ + (¥—yo)? = (vy — k)? 
1 Xo mty—k 


2 
y= — - 1 
Ja ae er 
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Hence, it is natural to try to solve, with respect to xo, yo and k, the system of 
equations 


1 XO xe 1 
—— =a, —-— =), —— + 00th) = c. 
2(yo — k) yo—k 2(yo—k) 2 
This, in turn, is immediate: the first two equations readily give x9 = —#; then, 


substituting the first equation and the value of x9 into the third equation, it comes 


that 
+k=2 ; : 2 a = 
= na ee) c-—.: =-— 
- © 0" 36 —®) 4° 2a 


finally, by solving the system of equations 


i +k= 2 
me ~ 2a’ a 9a’ 


we get yo = uA and k = —H4 
To what was left to prove, since the vertex of the parabola is the intersection of 


the line x = -f£ with the graph, its ordinate y equals 


oO 


We finish this section with the coming proposition, which establishes an impor- 
tant relation between the graphs of a bijection and its inverse. Then, we use it to 
sketch the graphs of two important functions. 


Proposition 6.63 Let1,J C R be finite unions of interval. If f : I > J is a bijection, 
then the graphs of f and f—' are symmetric with respect to the bisector of the odd 
quadrants of the cartesian plane. 


Proof Fix a € I and b € J. From the definition of inverse function, we have 
(a,b) € Gr @ b=f(a) 
a=f"'(b) 
> (b, a) € Gy-. 


However, since the points (a,b) and (b, a) are symmetric with respect to the line 
y = x, there is nothing left to do. Oo 
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Fig. 6.13 Sketching the graph of x > ./x 


Example 6.64 Sketch the graph of the square root function 


f : [0, +00) —> [0, +00) 
x > Ke 


Solution We saw at Example 6.49 that f is the inverse of g : [0, +00) — [0, +00), 
given by g(x) = x* for every x € [0, +00). Since we already know the graph of g, 
it follows from the previous proposition tha the graph of f is the symmetric of the 
graph of g with respect to the line y = x (cf. Fig. 6.13). oO 


Example 6.65 Recall that the inverse proportionality function is f : R \ {0} > 
R \ {0}, such that f(x) = 1, for every x € R \ {0}. With the previous discussion 
at our disposal, we can accurately sketch its graph. Indeed, f is clearly decreasing 
in (0, +00); also, we already know from the Problem 10, page 171, that f is odd, 
so that (this time by the Problem 5, page 193) its graph is symmetric with respect 
to the origin of the cartesian plane; on the other hand, since f is its own inverse, 
the previous proposition assures that its graph is also symmetric with respect to the 
bisector of the odd quadrants; finally, we will see later that its graph is a continuous 
curve (i.e., one with no interruptions) which “opens upwards” in (0, +00). 

Since it is intuitively clear (and will be formalized later) that f(x) = + becomes 
more and more close to zero as long as x increases without bound, we arrive at 
the sketch shown in Fig. 6.14. There, besides the above remarks, we plotted the 
auxiliary points (n, +), for 1 <n < 4 integer. 
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Fig. 6.14 Sketching the graph of f(x) = 4 


Problems: Section 6.6 


1. 


5. 


If f : [2, +00) — [1, +00) is given by f(x) = x* — 4x + 5, show that f is a 
bijection and find the intersection points of the graphs of f and f~!. 

For the next problem, given a nonempty set X and a function f : X —> R, 
we say that f is bounded if there exists M > 0 such that |f(x)| < M, for every 
xEXx., 


. * LetZ C R be an interval and f : J > R be a given function. If f is bounded, 


prove that its graph is contained in a horizontal strip of the cartesian plane 
bounded by two parallel lines. 


. *Tf7 C Ris an interval and f : J — / is a given function, show that the fixed 


points of f are the abscissas of the intersection points of the graph of f with the 
bisector of the odd quadrants. 


. * Let? C R be an interval and f, g : J > R be given functions. Explain how to 


analytically identify the intersection points of the graphs of f and g. 

For the next problem, we think the reader will find it useful to read again the 
statement of Problem 10, page 171. 
* Let 7 C R be a union of intervals, symmetric with respect to 0 € R, and 
f :1— Rbea given function. 


(a) Iff is even, prove that Gy is symmetric with respect to the vertical axis. 
(b) Iff is odd, prove that Gy is symmetric with respect to the origin. 


. In each of the following items, sketch the graphs of the given real functions of 


areal variable in a single cartesian system: 
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(a) i@® =x, f(a) = x4 and f(x) = x. 
(b) AG) =x, A(x) =x andf(x) =x. 


7. Sketch the graph of the function f : R > R such that f(x) = </x, for every 
xeER. 
8. Sketch the graph of the integer part function |-| : R — R (cf. Problema 9, 
pagina 152). 
For the next problem, we think the reader may find it useful to read the 
statement of Problem 15, page 171 again. 
9. * Do the following items: 


(a) If,f : R — R is periodic of period p > 0, explain how to draw the graph of 
f by knowing the portion of it in the interval 0 < x < p. 

(b) Use item (a) to sketch the graph of the fractional part function {}: RR 
(cf. Problem 10, page 152). 


10. * Let f : R > R be a given function and a ¥ 0 be a given real number. Prove 
that the graph of: 


(a) g(x) = f( + @) is obtained by translating the graph of f of —a units in the 
direction of the horizontal axis. 

(b) g(x) = f(x) + ais obtained by translating the graph of f of a units in the 
direction of the vertical axis. 

(c) g(x) = —f(x) is obtained by reflecting the graph of f along the horizontal 
axis. 

(d) g(x) = f(—x) is obtained by reflecting the graph of f along the vertical axis. 

(e) g(x) = af(x) is obtained by vertically stretching (resp. compressing) the 
graph of f by a factor a, if a > 1 (resp.0 < a < 1). 

(f) g(x) = f(ax) is obtained by horizontally stretching (resp. compressing) the 
graph of f by a factor a, if a > 1 (resp.0 < a < 1). 


11. * Use the results of the previous problem to sketch the graph of the function 
f :R\ {2} > R given by f(x) = <4, for every x € R \ {2}. 

12. Let/ be an interval and f : / — R be a given function. Which relation does exist 
between the graphs of f and of the function g : J > R, given by g(x) = |f(x)|? 
Apply your conclusion, together with the result of Problem 10, to sketch the 
graphs of the functions listed below: 

(a) g(x) = aI , for every x € R \ {-1}. 

(b) g(x) = |x? — 4], for every x € R. 

(c) g(x) = |x* — |x4+ 2| + 2], for every x € R. 
(d) gx) =1- ae for every real x # 2. 


13. Prove that the graph of the inverse proportionality function is obtained by a 
trigonometric (i.e., counterclockwise) rotation, of + radians, of the hyperbola® 
of equation x? — y? = 2. 


8For the necessary background on the equation of hyperbolas, we refer the reader to Chapter 6 of 
[4], for instance. 
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6.7 Trigonometric Functions 


The sine function is the function sin : R — R, that associates to each x € R the 
sine of an arc of x radians’: 


sin: R—> R 
x > sinx’ 


Analogously, we define the cosine function by 


where cos x stands for the cosine of an arc of x radians. 

Some basic properties of the sine and cosine functions are collected in the 
following proposition, for which the reader may find it useful to recall the statements 
of Problem 15, page 171, and Problem 10, page 171. 


Proposition 6.66 Sine and cosine functions have image {[—1, 1] and are periodic of 
period 2. Furthermore, sine function is odd, whereas cosine function is even. 


Proof Immediate from basic Trigonometry. Oo 


According to the above proposition and the discussion contained in Problem 15, 
page 171, in order to get an accurate sketch of the graph of the sine function, it 
suffices to make it in the interval [—z, x], then copying this portion of the graph to 
each interval of the form [—z + 2k, + 2kz], where k € Z. 

On the other hand, since the sine function is odd, in order to get its graph in 
the interval [—z, z], it suffices to sketch it in the interval [0, 2]; once this is done, 
then, reflecting it across the origin of the cartesian plane, we obtain (from item (c) 
of Problem 10, page 171) the graph in the interval [—z, z]. 

We shall prove in Section 8.1 (cf. Example 8.8) that the graph of the sine 
function is a continuous curve, i.e., a curve with interruptions. On the other hand, in 
Section 9.7 we shall show (cf. Example 9.66) that such a graph “ opens downwards” 
in the interval [0, z]. For the time being, by assuming the validity of these facts, we 
can finally sketch the graph of the sine function. 


Example 6.67 Gathering together the above information on the behavior of the sine 
function in the inteval [0, 2] (image, periodicity, continuity and concavity), together 
with the fact that it is increasing in [0, ZI, decreasing in [3. | and such that 
sin(z — x) = sinx for every x € R, we conclude that, in order to get a reasonably 
accurate sketch of its graph in that interval, we just need to compute the values of 
sin x for some values x € [0, Z|. This is done in the table below: 


°For the necessary background on Trigonometry for this section, we refer the reader to Chapter 7 
of [4]. 


196 6 The Concept of Function 


Fig. 6.15 Graphs of the sine 
and cosine functions 


x |z 
4 


2 [va | Wap? 


aE: ‘ 
0 1 


With all this at hand, we immediately get Fig. 6.15, first on the interval [0, z], 
then on the interval [—z, zr] and finally on R. 
We now recall one more piece of basic Trigonometry, which assures that 


cos sin ( +=) 
CS x+— 
2 


for every x € R. Hence, item (a) of Problem 10, page 194, guarantees that, once the 
graph of the sine function is drawn, we can get a corresponding sketch for the graph 
of the cosine function by translating the graph of the sine function of —> in the 
direction of the horizontal axis. In Fig. 6.15, the graph of the cosine function in the 
interval [—z,, zr] is given by the dashed curve contained in the strip of the cartesian 
plane bounded by the (also dashed) lines y = —1 andy = 1. 


Let us now look at a relevant application of the Ptolemy’s formulae for the sine 
and cosine of the sum and difference of two arcs, which tells us how to proceed to 
study a function given as a linear combination of the sine and cosine functions. 


Example 6.68 Given positive integers a and b, let f : R — R be the function such 
that 


f(x) = acosx+ b sinx, 


for every x € R. Writing 


a b 
H = “/p2 2 j 
acosx+ b sinx = a+b ( a pe cosx + a pe sin) ‘ 


we notice that 
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Fig. 6.16 Defining angle a 


. a b . . . 
Therefore, point P( sa ate) belongs to the portion of the trigonometric 
circle situated in the first quadrant, so that there exists a real number @ € (0, 5) for 


which 
a . b 
cosa = ————__ ands sina = 


(cf. Fig. 6.16). Now, it follows from the formula for the cosine of a difference that 


II 


acosx + b sinx 


f(x) 
= Va’ + b?(cosacosx + sina sin x) (6.15) 
= Va? + b* cos(x — @). 


In particular, since | cos(x — @)| < 1, we get 


lf (x)| = Va2 + b?| cos(x — a)| < Va? + b?, 


and it’s not difficult to use (6.15) to prove that the image of f is precisely the interval 
[—c, c], where c = Va? + b? (in this respect, see Problem 3). 


We close this section with an elementary study of the tangent function, i.e., the 
function that associates, to each real x in its domain, the real number tan x = er 
Since 

TU 
cosx=O08x= em Ik eZ, 
the (maximal) domain of definition of the tangent function is the set 
TU 
D=R\ {5 + ke ke Zh, 
so that the function we wish to study is 


tan: D—> R 
x -K> tanx’ 
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For x € D, we have 


sin(x + 7) —sinx 


tan(x + 7) = = tanx, 


cos(x +7)  —cosx 


and it’s immediate to verify that there doesn’t exist a real number 0 < p < zw such 
that tan(x + p) = tan x for every x € D. Therefore, the tangent function is periodic 
of period z. Furthermore, since D is a subset of R which is symmetric with respect 
to 0 and 


sin(—x) —sinx 
—_ = = —tanx 


tan(—x) = — 
cos(—x) cosx 


for every x € D, we conclude that the tangent function is odd. 

From what was collected above, in order to sketch the graph of the tangent 
function it suffices to do it on the interval [ 0, z). Indeed, the odd character of 
the tangent function assures that its graph on the interval (-%, 0] is obtained 
by reflecting, around the origin of the cartesian system, the portion of it on 
the interval [0, a) . On the other hand, since we have sketched the desired graph 
on the interval (-4. z), the periodicity of the tangent function will allows us to 
sketch the graph of it on all intervals of the form (-3 +k, = + kx), with k € Z: 


—4, 4) of kx units in the direction 


it suffices to translate the graph on the interval ( Fi 


of the horizontal axis, for every k € Z. 

In Section 8.1 (cf. Problem 5, page 254), we shall prove that the graph of the 
tangent function, restricted to the interval (-4. z), is a continuous curve, 1.€., a 
curve without interruptions. On the other hand, in Section 9.7 (cf. Example 9.66) we 
shall establish the fact that the graph “opens upwards” on the interval [ 0, a) . For the 
time being, assuming the validity of these statements and in view of the discussion 
on the previous paragraphs, we can sketch the graph of the tangent function by 
tabulating some values of it on the interval (0, Zz). The result is approximately that 


of Fig. 6.17. 


Problems: Section 6.7 


1. * The purpose of this problem is to introduce the inverse trigonometric 
functions. To this end, do the following items: 


(a) Show that the restriction of the sine function to the interval [-4, Z| defines 


an increasing bijection (which we also denote by sin, whenever there is 


no danger of confusion) sin : [-4, Z| — [-1, 1]; its inverse is the are- 
sine, denoted arcsin : [—1, 1] > [-4, Z|, which is also increasing. Then, 


compute arcsin 4, arcsin 1 and arcsin(—1). 


6.7 Trigonometric Functions 199 


Fig. 6.17 Graph of the 
tangent function 


(b) Show that the restriction of the cosine function to the interval [0, 7] defines 
a decreasing bijection (which we also denote by cos, whenever there is no 
danger of confusion) cos : [0,7] — [—1, 1]; its inverse is the arc-cosine, 


denoted arccos : [—1, 1] — [0, 2], which is also decreasing. Then, compute 
arccos 5, arcsin | and arcsin(—1). 
Show that the restriction of the tangent function to the interval (-4, =) 


defines an increasing bijection (which we also denote by tan, whenever 


there is no danger of confusion) tan : (—4, =) — R; its inverse is the arc- 
tangent function, denoted arctan : R > (3. z), which is also increasing. 
Then, compute arctan 1, arctan /3 e arctan + ee 


= 
fe) 
— 


For item (a) of the next problem, recall that the cotangent of x 4 kz (k € Z) 
is the real number cot x, defined by cotx = =. In particular, cotx = a. 
2. In each of the items below, sketch the graph of the given function: 


(a) f: R\ {ka; k € Z} > R, given by f(x) = cotx. 
(b) f : [-1,1] > [-3, =), given by f(x) = arcsin x. 
(c) f : [-1,1] — [0, z], given by f(x) = arccosx. 
(d) f: R> [—4, 4], given by f(x) = arctan x. 


3. * Let a and b given real numbers, not both zero, and f : R > R be the function 
given by 


f(x) = acosx + bsinx. 


(a) Find the image set of f. 

(b) Prove that f is periodic of period 27. 

(c) Sketch the graphs of f and of the sine function in a single cartesian 
coordinate system. 
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4. 


6. 
7. 


8. 


10. 
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* Let a and b be given real numbers, at least one of which is nonzero, and 
f :R — R be the function given by 


F(x) = acos(Ax) + b sin(Ax), 


where A is a nonzero real number. Show that f is periodic and compute its 
period. 


. Letf : R > R be the function given by f(x) = 2 sinx + cos(2x). 


(a) Compute the maximum and minimum values of f, as well as the real 
numbers x for which f attains these values. 

(b) Show that f is periodic, of period 27. 

(c) Sketch the graph of f. 


(Canada) Compute the number of real solutions of the equation sinx = 75. 
Find the maximum value attained by the function f : [—1, 1] — R, given by 
f(x) = 3x + 475 — x. 

(New Zealand) Let a be a given irrational number. Prove that the function f : 
R — R, defined by 


f(x) = cosx + cos(@x), 


for every x € R, is not periodic. 


. Find all integer values of n for which the function f : R — R, given for x € R 


by 
. (5x 
f(x) = cos (nx) sin (=). 


is periodic of period 37. 
(Canada) Prove that the function f : R > R given by f(x) = sin(x’) is not 
periodic. 


Chapter 7 
More on Real Numbers 


This chapter proceeds with the study of real numbers by presenting the notion of 
convergence for (infinite) sequences and series of real numbers. Among other appli- 
cations, we shall introduce one of the two most important numbers of Mathematics, ! 
the number e. We shall also present a famous result of Kronecker on dense subsets of 
the real line, which will find several interesting applications, here and in the coming 
chapters. 


7.1 Supremum and Infimum 


We begin this chapter by examining the completeness of R from another viewpoint, 
for which we need to introduce some preliminary concepts. 

A nonempty subset X C R is bounded from above if there exists a real number 
M such that 


X C (—o0, M]. 


In this case, we also say that M is an upper bound for X. Similarly, a nonempty set 
X C Ris bounded from below if there exists a real number m such that 


X C [m, +00). 


In this case, we say that m is a lower bound for X. Finally, a nonempty set X C R 
is bounded if X is simultaneously bounded from above and from below. 


‘As the reader probably suspects, the other one is the number z, which is defined as the numerical 
value of the area of a circle of radius 1—see [4], for instance. 
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Yet in another way, a nonempty set X C R is bounded (resp. bounded from 
above, bounded from below) if there exists a positive real number a such that 


-—a<x<a(resp.x<a, x>-a), VxEX. 


On the other hand, a nonempty set X C R which is not bounded (resp. not bounded 
from above, not bounded from below) is said to be unbounded (resp. unbounded 
from above, unbounded from below). In this case, given any positive real number 
a, one can always find an element x € X such that 


x ¢ [—a,a] (resp. x > a, x < —a). 


Examples 7.1 


(a) The set X = {1, 3, i, i: ...} is bounded. Indeed, 0 is a lower bound and | is an 
upper bound for X. 
(b) Bounded intervals (cf. Definition 1.10 and subsequent discussion) are bounded 


sets, in the sense of the above discussion. 


Example 7.2 If anonempty set X C R is bounded from above, then the subset Y of 
R given by Y = {—x; x € X} is bounded from below, and conversely. Indeed, a real 
number a is an upper bound for X if and only if —a is a lower bound for Y. 


In spite of its apparent obviousness, we shall list our next example as an axiom 
of the natural number system, known as its archimedian property, after the Greek 
mathematician Archimedes of Syracuse.” Some important consequences of it are 
collected in Problems 2, 3 and 4, page 206. 


Axiom 7.3. The set N of natural numbers is unbounded from above. 


Continuing with the development of the theory, fix a bounded from above 
nonempty set X C R. If M ¢€ R is an upper bound for X, then X C (—oo, M]. 
Nevertheless, it may happen that there exists M’ < M which still is an upper 
bound for X, i.e., is such that X C (—oo, M’]. Indeed, the condition X C (—oo, M] 
doesn’t guarantee that, for M’ < M, we have XN (M’, M] # Q; if it happens that 
XM (M’, M] = @, then we will have X C (—oo, M’], and M’ will be another upper 
bound for X, which is less than M. 

On the other hand, given x € X, it is obvious that no M’ < x is an upper bound 
for X, since x € X \ (—co, M’], i.e., X Z (—oo, M’]. To put in another way, the set 
of upper bounds of a nonempty, bounded from above set X C R is bounded from 
below. 


? Archimedes, who lived in the III century b.C., was the greatest scientist of his time. Among many 
important contributions to Mathematics and Physics, his seminal ideas on how to compute the 
area under a parabolic segment anticipated, in some 2000 years, the development of the Integral 
Calculus by Newton and Leibniz. 
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As we shall see later (cf. Problem 9, page 242), the completeness of R, as 
postulated in Sect. 1.3, is a consequence of the following statement, which sharpens 
the discussion of the two previous paragraphs and will be taken as an axiom. 


Axiom 7.4 [fanonempty set X C R is bounded from above, then there exists a real 
number M satisfying the two following properties: 


(a) M is an upper bound for X. 
(b) If M’ <M, then M' is not an upper bound for X. 


In the hypotheses and notations of the previous axiom, a moment’s thought shows 
that there exists at most one real number M satisfying the properties there stated. 
Indeed, if two distinct real numbers M, and M) did the job, suppose, without loss 
of generality, that M; < M2. Then, on the one hand, item (a) would guarantee that 
M, and M) are upper bounds for X; on the other, since M; < Mo, item (b) would 
guarantee that M, is not an upper bound for X. 

Yet in the hypotheses and notations of the axiom, the discussion of the last 
paragraph allows us to say that M is the least upper bound or supremum of X. 
We denote M = sup(X) or M = lub(X). 

Examples 7.5 

(a) The set X = {1, 5. i, i ...} has | as an upper bound. On the other hand, since 
1 € X, no real number less that 1 can be an upper bound of X, so that sup(X) = 
1. 

(b) If X = (1,2) (an open interval), then 2, but no real number less than 2, is an 


upper bound for X. Indeed, if 1 < a < 2, then the number ia is also greater 
than 1 and less than 2, so that ita e€ X. Now, since a < ite € X, the number 


a cannot be an upper bound for X. Therefore, sup(X) = 2, and observe that 
2€xX. 


If 6 # Y C R is bounded below, then one can prove, as an easy consequence of 
Theorem 7.4 (cf. Problem 6), that Y admits a greatest lower bound m, which is 
also said to be an infimum of Y. As in the case of nonempty, bounded above sets, 
one can easily prove that a nonempty, bounded below set Y has a unique greatest 
lower bound m; hence, we denote m = inf(Y) or m = glb(Y). 

For future use, the coming results collect some useful properties of the notions 
of sup and inf. 


Proposition 7.6 Let X C R be a nonempty, bounded above set, and M = sup(X). 
Ifn €N, then there exists x € X such that 


1 
M-—-<x<M. 
n 


Proof Since M is the least upper bound of X and M— 4 < M, the real number M— 4 
is no longer an upper bound of X. Hence, there exists x € X such that x > M — 1 
However, since X C (—co, M], we must also have x < M. oO 
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Problem 10 states an analogous result for the greatest lower bound of nonempty, 
bounded below sets. 


Proposition 7.7 Let X,Y C R be nonempty sets. If x < y for allx € X andy € Y, 
then X is bounded above, Y is bounded below and 


sup(X) < inf(Y). 


Proof Fix y € Y arbitrarily. Since x < y for all x € X, the number y is an upper 
bound for X. Hence, X is bounded above and, letting M = sup(X) (the /east upper 
bound of X), we have M < y. 

Since y € Y was arbitrarily chosen, the reasoning of the previous paragraph 
shows that, for all y € Y, we have M < y. Therefore, M is a lower bound for Y, so 
that Y is bounded below and, letting m = inf Y (the greatest lower bound of Y), we 
have M < m. oO 


The next result gives a sufficient condition for equality to happen in the previous 
proposition. 


Proposition 7.8 Let X,Y C R be nonempty sets, such that X is bounded above, Y 
is bounded below and sup X < inf Y. If, for every n € N, there exist x, € X and 
Yn € Y satisfying yy — Xn < 1 then sup(X) = inf(Y). 


Proof Let M = sup(X), m = inf(Y) and suppose that M < m. Sincex < M<m< 
y for all x € X and y € Y, we would have y — x > m—M, forallx € X andy € Y. 

On the other hand, by choosing a natural number n > — (which is possible, 
thanks to Axiom 7.3), our hypotheses would guarantee the existence of real numbers 
X, € X and y, € Y such that 


1 
Vn —Xn < —- <m—M, 
n 
which is a contradiction! 
Finally, since the assumption that sup(X) < inf(Y) leads us to a contradiction, 
the only possibility is that sup(X) = inf(Y). Oo 


We finish this section by presenting a nontrivial (and important) application of 
the concept of supremum of a bounded above nonempty set. To set the stage, we 
recall that, in the previous chapters, we several times relied upon the existence of 
square roots of positive real numbers. The next result establishes existence in a more 
general setting. 


Theorem 7.9 Given a positive real number x and a natural number n > 1, there 
exists a unique positive real number y such that y" = x. 


Proof Let’s consider just the case x > 1, referring the reader to Problem | for the 
case 0 < x < | and observing that the case x = 1 is trivial. If 


X = {ae R; a> 0 anda” < x}, 
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then X is nonempty, since 0 € X. Also, X is bounded above, for, ifa > x + 1, 
then Problem 6, page 176, together with item (b) of Corollary 1.3, guarantees that 
a” > (x+ 1)" >x+1> 4x; therefore, a ¢ X and, thus, X C (—oo,x + 1). 

Being nonempty and bounded above, X has a least upper bound, say y. Since 
1” = 1 < x, we have | € X and, thus, y > 1. We shall show that y” = x in the 
following way: 


(i) If y" < x, we shall obtain a positive real number z such that y” < 2” < x. 
Then, the first inequality will give y < z, whereas the second (with the aid of 
the result of Problem 6, page 176) z € X. Therefore, we will get y < z € X, 
thus contradicting the fact that y is an upper bound for X. 

(ii) If y” > x, we shall obtain a positive real number z for which x < z” < y”. Then, 
taking an arbitrary a € X, it will follow from a” < x < z” < y” (again by the 
result of Problem 6, page 176) that a < z < y, so that z will be an upper bound 
for X which is less than y. This will contradict the fact that y is the least upper 
bound for X. 


Once we have eves that y” < x and y” > x both lead to contradictions, the only 
possibility left will be y” = x. 

For the proof of (i), suppose y” < x.Ifz=y+4 z with k € N, then, once more 
by the result of Problem 6, page 176, we have y” < 2 We assert that z” < x for all 
sufficiently large k. Indeed, the binomial formula (cf. Theorem 4.20) gives 


however, since (| ’) Ca ee B<5 + andy"! < y" for 1 <j <n, it follows that 


n(2y)” 


a “ 2" A A 
gy ee, = 


j=l 
therefore, we have z” < x provided y” + aoe <x,ie.,k > a 


For the proof of (ii), suppose y” > x. Letting Z=y- with k €N, then, as 
in the previous paragraph anterior, we have y” > z”. We dla that z” > x for all 
sufficiently large k. To this end, arguing as above we have 


nan o 


—x 


210" > x, ie., k> 


hence, z” > x whenever y” — 


For another, conceptually simpler, proof of the shoe theorem, we refer the reader 
to Example 8.36. 
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Problems: Section 7.1 


1. * Yet in respect to roots of positive real numbers, do the following items: 


(a) If 0 < x < 1 andn > 1 is natural, show that the real number aE (whose 


existence is guaranteed by Theorem 7.9, since + > 1) is the n-th root of x. 
(b) Givena,b > Oandm,n > | naturals, show that </ab = 2/av/b, fe — 


and m/q = ta. 


(c) Given 0 <a < bandn > | natural, show that 2/a < Vb. 

(d) Given a > 0 andm > n > 1 naturals, show that */a > */aifa > 1, and 
Ya < aif0<a<1. 

(e) Let x < 0 be a real number and n € N be odd. If y = —2/—x, show that 
y” = x (the real number y is also called the n-th root of x) and extend the 
properties of items (b)—(d) to this case. 


2. * Use the archimedian property of the set of natural numbers to prove the 
following items: 


(a) If a € Ris such thatO <a< 1 for every n € N, thena = 0. 
(b) If a,b,c € R, with a > 0, then there exists n € N for which an + b > c. 


3. * Let a and b be given rationals, with a < b. Prove that: 


(a)a< wee 


(b) The saree (a, b) contains infinitely many rational numbers and infinitely 
many irrational numbers. 


<banda<a+ <b. 


4. * The purpose of this problem is to generalize the result of the previous one, 
showing that between any two given real numbers there is always a rational 
number and an irrational number (thanks to these properties, we say that Q and 
R \ Q are dense in R). To this end, let a and b be given real numbers, with 
a<b. 


(a) Show that it suffices to consider the case a > a 


(b) Prove that there exists n € N such that 0 <5 <b-—aand0< a2 <b—-a. 
(c) Letting a > 0en € N be chosen as in (b), show that one of the numbers 


Aa Bo SD 


-,=,=,... and one of the numbers .. belong to the interval 


5. * A rational number r € [0, 1] is said to be dyadic if there exist k,n € Z such 

that 0 <n < 2‘ andr = eS Prove that the set of dyadic rationals is dense in 
[0, 1], i.e., that for every a € [0,1] and € > 0, there exists a dyadic rational in 
the interval (a—¢€,a+¢). 

6. * If Y C R is nonempty and bounded below, prove that it has a greatest lower 
bound. 


7. LetX = {xE€Q,0<x< lh}andY = {ye R\Q; 0 <y <1}. Prove that 
inf(X) = inf(Y) = 0 and sup(X) = sup(Y) = 1. 


TA 


(oe) 


10. 


11. 


12. 
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. 1X = {| /a— Vb]; a,b € N and a F b}, compute inf(X). 
. Let 


Ci = (0,1) 

Co = (0,1) \ (1/3, 2/3) = (0, 1/3] U [2/3, 1) 

C3 = ((0, 1/3] \ A/9, 2/9)) U (2/3, 1) \ (7/9, 8/9) 
= (0, 1/9] U [2/9, 1/3] U [2/3, 7/9] U [8/9, 1). 


More generally, for each n € N, obtain C,4, from C,, by erasing the open 
middle third of each of the intervals that form C,,. If C = U,,.1 Cn, show that 
inf(C) = 0 and sup(C) = 1. 

* Let Y C R be a nonempty, bounded below set, with m = inf Y. Ifn € N, 
prove that there exists y € Y such that 


n>1 


1 
msy<m+-. 
n 


Let X,Y C R be nonempty sets, such that X is bounded above, Y is bounded 
below and sup(X) = inf(Y) = a. Ifa ¢ X U Y, prove that there exist elements 
X, € X and y, € Y for which y, — x, < i, for everyn EN. 

* Let X C R bea nonempty, bounded above set. Givenc € R, let cX = {cx; x € 
X}. Prove that: 


(a) Ifc > 0, then sup(cX) = csup(X). 
(b) If c <0, then cX is bounded below and inf(cX) = c sup(X). 


Then, if X is bounded below, establish properties analogous to the ones listed 
above, relating inf X to the sup or the inf of cX, according to whether c < 0 or 
c>0. 


. *Let X,Y C R be nonempty sets and X + Y = {x+ y; x € X and ye Y}. 


(a) If X and Y are bounded above, prove that X + Y is bounded above and 
sup(X + Y) = sup(X) + sup(Y). 

(b) If X and Y are bounded below, prove that X + Y is bounded below and 
inf(X + Y) = inf(X) + inf(Y). 


. * Let X,Y C [0,+00) be nonempty, bounded above sets. If XY = {xy; x € 


X andy € Y}, prove that XY is bounded above and such that sup(XY) = 
sup(X) - sup(Y) and inf(XY) = inf(X) - inf(Y). 


. (Hungary) Let (R,)»>1 be a sequence of pairwise distinct rectangles in the 


cartesian plane, each of which having all vertices with integer coordinates and 
two sides along the axes. Prove that one can find two of them such that one 
contains the other. 


. (IMO) Let f, g : R > R be functions satisfying, for all x, y € R, the relation 


fet+y) +fa-—y) = 2f@eg). 


If f is not identically zero and |f(x)| < 1 for every x € R, prove that |g(x)| < 1, 
for every x € R. 
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7.2 Limits of Sequences 


Given a sequence (a;,),>1 of real numbers, we are interested in recognizing whether 
or not their terms are approaching a certain real number /, as n increases. For 
instance, if a, = 4, it is reasonable to say that the numbers a, become closer and 
closer to 0 as n increases, since the result of the division of 1 by n is increasingly 
smaller as n increases. This naive point of view is formalized as follows. 


Definition 7.10 A sequence (a,),>1 of real numbers converges to a real number / 
if, given an error € > 0 for the value of /, there exists an index np € N such that 
la, —1| < € for every n > no. 


Alternatively, if the sequence (a,),>1 converges to /, we say that it is convergent 
and that / is a limit of the sequence, which we denote by writing 


Gas or lim a, =1. 
n—>+0o0 
Finally, a sequence which is not convergent is said to be divergent. 

In general, if we diminish the error € > 0 and the condition “Ja, — l| < ¢€ for 
every n > no’ is to continue holding, then the natural number no in the definition 
of convergent sequence tends to increase. In other words, in general no depends 
on € > 0. Anyhow, what is important to assure the convergence of the sequence 
(@n)n>1 is that, for an arbitrarily given error € > 0, we are capable of finding np € N 
such that 


n> no => la, —l1| <e. 


For the reader to get used to the important concept of convergent sequence, we 
collect below some elementary examples of convergent and divergent sequences. 


Examples 7.11 


(a) If a, = 4, then a, —“. 0: indeed, for a given « > 0, we have |a, — 0| < € 
provided n > i; thus, once we have chosen no € N such that np > 1, we will 
have |a,, — 0| < € whenever n > no. 

(b) Ifa, = (—1)”, then (a,),>1 is divergent: indeed, since the terms of the sequence 
are alternately equal to 1 and —1, it is impossible for them to (collectively) 


become closer to a single real number / (formalize this intuition). 

(c) Ifa, = 1+ cy. then ay —"s 1: this is so because la, —1| = 1, so that 
la, —1| < € forn > 1. 

(d) If (a,)n>1 is a constant sequence, with a, = c for every n > 1, then a, > c. 


Example 7.12 If an = q", with 0 < |g| < 1, then a, —> 0. 
Proof Since a > 1, we can write ul = 1+ a, with a > 0. Therefore, taking the 


first two terms in the binomial expansion formula, we get 


1 
q\" 
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and, hence, 
an — O| = Iq|" < ‘ 
lan — 0| = |g" < —— 
Thus, if we wish that |a, — 0| < e, it suffices to impose a < € or, equivalently, 
na 
je Liles, n 
a € 


Example 7.13 The sequence (a,)n>1, given forn > 1 by a, = J/n+1— Jn, 
converges to 0. 


= 1 : : 
Proof Note that a, = Tatit ae Thus, given € > 0 and taking no € N such that 
No > a. we have 


1 
n>n => Vn+t1l+JSn> Van t1+ Sno > 2/1 > -. 
€ 


Therefore, 
1 
= ——_- < € 
Jvn+1+ fn 


n> no => |an — 0} 


oO 


The notion of convergent sequence doesn’t make it clear whether the correspon- 
dent limit is unique. Yet in another way, in principle it could happen that a certain 
sequence converges to more than one limit. The coming result shows that this is not 
so. 


Proposition 7.14 If the sequence (dy)n>1 converges, then its limit is unique. 


Proof Let 1; and I, be distinct real numbers, and suppose that the given sequence 
simultaneously converges to /; and J). Toking € = sh — | > 0, the definition of 
convergence guarantees the existence of n;,2 € N such that 


n>n, => |a,—h|<¢€ and n> nj => |a,—h| <e. 
Therefore, triangle inequality gives 
n> max{n,, 12} > |; — | < lan = l;| + lan —1| <2e= Ly = bl, 


which is an absurd. oO 


Thanks to the previous result, from now on we speak of the limit of a convergent 
sequence. In this respect, the next proposition collects two basic, albeit very 
important, properties of limits of convergent sequences. In order to state it properly, 
we define a subsequence of a sequence (d,),>1 as the restriction of the given 
sequence to an infinite subset N} = {n; < m2 < n3 < ---} of N; in this case, 
we denote it by (dn,)xen. Since the function j +> n; from N, to N is a bijection, 
every subsequence can actually be seen as a sequence. 
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Proposition 7.15 Let (a,),>1 be a convergent sequence, with limy++4o0 dn = I. 
Then: 


(a) If a, = a (resp. An < a), for every n = 1, then! = a (resp. 1 < a). 


(b) Every subsequence (ay,)x>1 Of (an)n>1 also converges to I. 
Proof 


(a) Suppose that a, => a for every n = 1, and let’s show that / > a (the other case 
is completely analogous). By contradiction, if 1] < a, take ¢ = a—I1 > 0. The 
definition of convergence guarantees the existence of an index mg € N such that 
n> no => |an — | < €; in particular, given n > no, we have 


an<l+e=Il+(a-)=a, 


which is an absurd. 

(b) Let € > 0 be given. Since a, ms 1, there exists a natural number ng such that 
la, —1| < € forn > no. Since ny < np < n3 <---, there exists an index n; such 
that nj > no for j = i; hence, for all such j, we have |a,, — 1| < €, which is the 


: k 
same as saying that a, —> l. oO 


In words, item (b) of the previous proposition says that, if the terms of a certain 
sequence come closer and closer to / as their indices increase, then the same is true 
for the terms of every subsequence of the given sequence. Item (b) of the previous 
proposition also has the following immediate corollary, which gives us a sufficient 
(and quite useful) condition for the divergence of a sequence. sequéncia. 


Corollary 7.16 Jf two subsequences of a given sequence converge to distinct limits, 
then the original sequence is divergent. 


Up to now, except for some very simple examples we haven’t seen how one could 
find out the limit of a convergent sequence. In order to remedy this situation, we 
need to understand how to perform simple arithmetic operations with convergent 
sequences. We turn to this next, starting with an auxiliary result which is important 
in its own. 

We say that a sequence (a,),>; is bounded (resp. bounded from above, 
bounded from below) if the set {a), az, .. .} is bounded (resp. bounded from above, 
bounded from below), in the sense of the previous section. 


Lemma 7.17 Every convergent sequence is bounded. 


Proof Vf (an)n>1 1s a convergent sequence with limit /, then there exists 9 € N such 
that 


n> No => |an—1| <1. 
This, together with the triangle inequality, gives 


n> no => |ay| < lan —1 + [l] < 14 (J. 
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Finally, letting L = max{1 + |al, |ai|, |a2|,..., |anp—i|}, we get |an| < L for every 
n €N, so that the sequence is bounded. oO 


Proposition 7.18 Let (a,)n>1 and (bn)n>1 be convergent sequences, and c be any 
real number. 


(a) If ay pasar a, then can — + ca. 

(b) If ay as aand by, as b, then an + by past ax band aybn ay ab. 

(c) If dy —> 0 and (by)n=1 is bounded, then ayby, —> 0. 

(d) If ay sa and by yy b, with b, b, 4 0 for every n > 1, then - a : 
Proof 

(a) If c = 0, then ca, = ca = 0, and there is nothing to do. Suppose, then, that 


c # 0, and let € > 0 be given. Since a, as a, there exists no € N such that 
n> No => lan —al| < Ta Hence, 


€ 
n > No => |cay — cal = |c||a, —a| < |c|- — =e. 


Ic| 


(b) For the first part, let’s prove that a, +b, mle +b (to prove that a,—b, — a—b 


. ° n n 
is completely analogous). Given e > 0, the convergences a, —> a and b, —> b 
assure the existence of n,,n2 € N such that 


€ € 
Be HE an Al and Been SO) Se: 


Therefore, taking n > max{n,,n2}, we get 
€ € 
\(an + bn) — (a + b)| < lan — | + |b, — DI < a se 


For the second part, let L > 0 be such that |b,| < L for every n € N. Given 
€ > 0, take no € N for which 


€ € 
Sapa Sa) Stand (by She ——— 
a aaa ee I< Fas 


Then 
|anby — ab| = |dyby — aby + aby — ab\ < |ay — a\lbn| + |al|Pn — | 
€ € 


€ € 
<—-DL+ »——_ < -4+-=€. 
OL led 279 


(c) Let L > 0 be such that |b,| < L for every n > 1. Given € > 0, let’s take np € N 
such that 


€ 
n> no => lanl < F- 
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Then, 


€ 
n> No => |anbya — O| = |an||Dn| < eo €. 


(d) By the second part of item (b), it suffices to show that x mur i: To this end, 
start by observing that 


b, b 


(BL [bal [B] [b) = bn = BP 


1 , 1 [bn-bl 1 |b, — b| 


where we used the triangle inequality in the last passage above. Now, given 
€ > 0, choose ng € N such that 


=> |b, —b| < é Jo 
> n— ==. 
eis ae 
Then, for n > no, we have 
1 1 1 |by — Dl 
——-| < —-—_ _ =) |b, —b| <e. 


oO 


Example 7.19 Let a be a positive real number. If (ay)n>1 is given by a, = %/a for 


n 
every n > 1, thena, —> 1. 


Proof Ifa> 1, then a, > 1. Write a, = 1+ by, so that b, > 0. Since 


n 


oman crsotzr4(’ 


)p =1+nb,, 


we get 0 < by, < —s Hence, the squeezing principle (cf. Problem 6) guarantees 


that b, —~> 0, and item (b) of Proposition 7.18 gives dn = 1 +b, —> 1. 


1 wt 


If0<a<l,letd,=—= so that a, —* 1 by the first part. Then, item 


an a’ 


(d) of Proposition 7.18 gives that a, sl, oO 


Example 7.20 The sequence (a,),>1, given by a, = %/n for every n > 1, converges 
to 1. 


Proof As in the previous example, write a, = 1+ b, forn > 2. Since b, > 0, 
we have 


-1 
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so that 


i222. 
n 


Hence, once more from the squeezing principle, the above inequality gives b,, +0 
and, thus, a, = 1+ b, —> 1. o 


For what comes next, recall that a sequence (d,)n>1 of real numbers is just a 
function f : N > R, for which we write a, = f(n). Hence, it is natural to say that 
(4n)n>1 iS Monotonic increasing (resp. decreasing, nondecreasing, nonincreasing) 
provided ay, < dp (esp. dn > An+1,4n < An+1; An = Gn+1), for every n > 1. 


The most important result on limits of sequences is the theorem below, which is 
known in the mathematical literature as Bolzano-Weierstrass theorem.° 


Theorem 7.21 (Bolzano-Weierstrass ) Every monotonic bounded sequence is 
convergent. 


Proof Suppose that (a,,),>1 is a nondecreasing bounded sequence (the other cases 
can be dealt with similarly), i.e., that 


a, 5a<5a3<:::<M, 


for some M > 0. Then, M is an upper bound for the set A = {a1, a2, 43,...}, so that 


A has a sup, say supA = 7. We claim that a, ef Indeed, let € > 0 be given; since 
1 — € is no longer an upper bound for A, some element of it is greater than / — e, 
say, dn, > 1 —€. Therefore, since dy.) < Any+1 < Gny+2 < +++, we conclude that 
a, > l—e for every n => no. Thus, for n > no, we have 


l-€<a,<Il<l+e, 


as we wished to show. oO 


The previous theorem, together with the definition of convergence, assures that if 
a bounded sequence is monotonic from a certain term on, then it will be convergent. 
We explore this comment by revisiting the last two examples. 


Example 7.22 Let a be a positive real number. If (a,),>1 is given by a, = %/a for 


n 
every n > 1, thena, —> 1. 


Proof Assume a > | (the case 0 < a < 1 can be dealt with as in the Example 7.19). 
Then, a; > ay > a3 > --- > 1, and the Bolzano-Weierstrass theorem guarantees 
the existence of / = lim,++4 0d, > 1. Item (a) of Proposition 7.15 gives | > 
1, and item (b) guarantees that every subsequence of (a,),>1 also converges to 1. 


3 After Bernard Bolzano and Karl Weierstrass, German mathematicians of the XIX century. 
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a 


k . k : : 
Therefore, axy.+1) —> I. Now, since agg4i) = a it follows from item (d) of 


2 Va 
Proposition 7.18 that 
Ya «i 
Ak(k+1) — “Va _ 7 — 1. 
Therefore, / = 1. oO 


Example 7.23 The sequence (a,)n>1, given by a, = «/n for every n > 1, converges 
to 1. 


Proof The initial terms of the sequence are /2, V3, V4, ..., and it is easy to 
directly show that V2 < /3and 73 > V4 > SS. Since 2” > rn forn > 4 (by 
induction, for instance), we get a2 > a, > | forn > 4, so that the sequence is 
bounded; hence, if we show that it is indeed decreasing from the third term on, its 
convergence will follow from Bolzano-Weierstrass theorem, with limit / > 1. 

For what is left to do, for an integer n > 2 we have 


1 n 
wn> "Vaal ont! > nt iy n> (142) : 
n 


Let’s prove the last inequality above. For n = 3, it’s immediate to check it 
numerically; for n > 3, it suffices to show that (1 + iy" < 3. To this end, notice that 


je |) ede |) re fag 
n) l}n 2) n2 n} n® 


and 
n\ 1 n! 1 n(n—-1)...m~—k+1) 1 2 1 
—- =O -_CSar OC OC OOOO SC—CFCA< OC —_—_—. 
k} nk kM(n—k)ink ik! nk kl ~ 2k-1 
Therefore, 


1. i 
eee ee ee ee 
qlee es 


In order to finish, we need to show that / = 1. To this end, first observe that the 
subsequence ax, = 2k also converges to /. On the other hand, V2k = */2- v Vk, 
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with %/2 aty 1. Now, it follows from Problem 2 that Lk an JI. Therefore, by 
applying item (b) of Proposition 7.18, we get 


l= lim VW2k= lim Y2- lim VVk= vi, 
k->+00 k+>+00 


k>+00 
oO 


Sometimes, we have to show that a given sequence has at least a convergent 
subsequence (even if, as a whole, the sequence does not converge). In this sense, 
Theorem 7.25 below, also due to Weierstrass, provides a sufficient condition for the 
existence of such a subsequence. Before we state and prove it, we need to discuss an 
important auxiliary result, known as the lemma of nested intervals. In what follows 
if J = [a, b], with a, b € R, we shall let |J| = b—a. 


Lemma 7.24 For n € N, let I, = [an,dn). If) D bh D bk D ... and 
limy—++00 [In| = 0, then there exists a unique 1 € R such that (ibs I, = {lh. 


Proof First of all, note that the intersection of the intervals J,,, if not empty, has a 
single element; indeed, if there existed reals a < b in such an intersection, we would 
have [a,b] C (),,5, Jn; in particular, [a,b] C I, and, hence, |J,| > b — a for every 
n € N, thus contradicting the fact that limy+ +0 |J,| = 0. 

Secondly, the inclusions D hh D kh D... giveaq, < a < a3 <::: 
<.+--<b3 < bo < bj, and the Bolzano-Weierstrass theorem assures the existence 
of J = limp++00 dn. We claim that 1 € (7,5) In. 

Since / = sup{a,;n € N}, it follows that a, < 1 for every n € N. On the 
other hand, for a fixed m € N, we have a, < b, for every n € N, and item (a) 
of Proposition 7.15 gives 1 = limp++00d, < bm. However, since m was chosen 
arbitrarily, we have / < b,, for every m € N. It thus follows that / € [am, bm] = In 
for every m € N, as we wished to show. Oo 


We are now in position to prove Weierstrass theorem. 


Theorem 7.25 (Weierstrass ) Every bounded sequence admits a convergent sub- 
sequence. 


Proof Let (dn)n>1 be a given bounded sequence, and Jp = [a,b] be a closed and 
bounded interval containing all of its terms. One (possibly both) of the intervals 
[a, ath) and [4 : b], call it 7;, also contains infinitely many terms of the sequence. 
Do the same with /,, obtaining a closed and bounded interval J, C J; such that |/>| = 
$\| and / contains infinitely many terms of the sequence (a,),>1. Proceeding 
inductively, we construct a nested sequence J) D Ih D Ik D --:: of closed and 
bounded intervals, such that |/;41| = $k and J; contains infinitely many terms of 
the sequence (a;,)n>1, for every k > 1. Therefore, by the lemma of nested intervals, 
there exists c € R such that (),., J = {c}. 

Now, choose 7, € N such that an, € 1; then, after having chosen n; € N such 
that An; € J;, choose nj+1 € N such that nj+,; > nj and Anjpy Jj41 (this is possible 
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by the way the J;’s were defined). This way, we inductively construct a subsequence 
(Gn, )k>1 Of (Gn)n>1, Such that a, € J, for every k > 1. Since |K| = 3 Io| and 
: k 
an, c € Iz, we conclude that |an, —c| < (lol, for every k > 1; since x (Io| — 0, 
k 

we conclude that a, — c. oO 

The concept of convergent sequence gives a precise meaning to the geometric 
intuition that the terms of the given sequence come closer and closer to a certain 
real number (the limit of the sequence), as long as their indices increase. However, 
it is also reasonable to expect that, if the terms of a given sequence come close 


together, then the sequence should also converge. This is indeed the case and, in 
order to establish it, we start with the following 


Definition 7.26 A sequence (a,),>1 is said to be a Cauchy sequence if, for every 
€ > 0, there exists ng € N such that 
m,n > No => |dm — an| < €. 
The fundamental result concerning Cauchy sequences is the content of the 
following 
Theorem 7.27 A sequence (a,)n>1 is convergent if and only if is Cauchy. 


Proof Let (an)n>1 be a convergent sequence, with limit /. Given € > 0, the definition 
of convergence guarantees the existence of m9 € N such that 


€ 
n>no => |a,—l| < x=. 
2 

Hence, given naturals m,n > no, the triangle inequality gives 


€ € 
lam — Qn| < |Qm — 1] + lan -—I < = +5 =, 
2 2 
and the sequence is Cauchy. 
Conversely, let (@;)n>1 be a Cauchy sequence. Then, there exists 79 € N such 
that |a,, — dn| < 1 for m,n > no. In particular, |am — an,41| < 1 for every m > no, 
and the sequence has all of its terms contained in the set 


{41,42,---, Ang} U (Ang+1 — 1, 4ng+1 + 1), 


so that it is bounded. Hence, by the theorem of Bolzano-Weierstrass, the sequence 


k 
(Gn)n>1 has a convergent subsequence, say, ay, —> J. Let us prove that, actually, 
An aS i 
Given € > 0, there exists No € N such that 
€ 


ne > No = lan — | < 5. 
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On the other hand, since the sequence is Cauchy, there exists N, € N such that 


€ 
m,n > N, => |dm — an| < % 


Letting M = max{N}, N>} and fixing n, > M, we have, for n > M and with the aid 
of the triangle inequality, 


é 


€ 
lan — 1 Ss lan — Any,| + lan, — 1 < at 5 


= €, 


as we wished to show. Oo 
The coming example collects an interesting application of the above result. 


Example 7.28 Let (dn)n>1 be a sequence of real numbers such that 


lan42 = an+1\ < C\dn+1 — an 


for every n € N, where 0 < c < 1 is a real constant. Show that this sequence is 
convergent. 

Proof By Theorem 7.27, it suffices to show that (a,)n>1 is a Cauchy sequence. To 
this end, iterating the inequality in the statement we get, for every k € N, 


lax41 — a| < car — al]. 


Let n and p be given natural numbers. The above inequality, together with the 
triangle inequality, gives 


n+p-1 n+p-1 
lant¢p—Gnl S D> lati -ael << DS can —ay 
k=n k=n 
cl —citp 
= | ———— ] |a -a 
(=) faa - a 
1 
< — + |a,—a,\c"!. 
c 


_ 


We now note that, by Example 7.12, the last expression above tends to 0 when 
n — +o. Therefore, given € > 0, there exists 9 € N such that “ lay—a,|c""! < 
€ for every n > no. Hence, forn > no and p € N, we get |dn4p) — an| < €, so that 
(4n)n>1 is, indeed, a Cauchy sequence. oO 
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Problems: Section 7.2 


1. * Let (dy)n>1 and (bn)n>1 be convergent sequences of real numbers, with 
limp++ood, = a and lim,++0b, = b. Generalize item (a) of Proposi- 
tion 7.15, showing that if a, < b, for every n => 1, thena < b. 

2. * Let (dy)n>1 be a sequence of positive real numbers converging to a > 0. Show 

that /a, tia, 

. * Given a € R such that |a| > 1, show that £ — 0 when n > +00. 

4. Generalize the result of Example 7.12, showing that, given k € N anda eR, 


OW 


with |a| > 1, we have aw +0. 
5. Let (dn)n>1 and (b,)n>1 be sequences of real numbers and, for each n € N, let 
t, € [0, 1] be given. Denote by (c,),>1 the sequence defined by 


Ch = (1 — tn)An + thbn, 


for every n EN. If ay, by a 1, show that c, mare | 

6. * Prove the squeezing theorem: let (4,),>1, (bn)n>1 and (Cn)n>1 be sequences 
of real numbers such that a, < b, < cy, for every n € N. If ay, cy —s 1, for 
some / € R, show that b,, einer a 

7. Compute the following limits: 


n/n 
n 


(a) lim,-+ 400 one 

(b) lim,+400(W/n2 + an +b —n), witha, b € R. 

(c) limy++oo */1 + q", where 0 < g < 1 is areal number. 

(d) limy—++00 a" +b", with a and b positive reals such that a > b. 


8. * This problem extends the concept of limit of sequences to consider infinite 
limits. We say that a sequence (a,,),>1 of real numbers converges to +00 (resp. 
—oo) if, given M > 0, there exists m9 € N such thatn > no => a, > M (resp. 
ay, < —M). In this case, we denote limy++0 dy = +00 (resp. liMp++o0 Gn = 
—oo), or simply a, —> +406 (resp. dy > —oo). With respect to this concept, 
and given sequences (dy)n>1 and (by)n>1 Of real numbers, do the following 
items: 

(a) If a, —s +00 and (by)n>1 is bounded, then a, + b, —"s +00. 

(b) If ay —" +oo and by, = c > 0 (resp. b, < c < 0) for every n => 1, then 
AnDn —"s +00 (resp. a,by, > oo). 

(c) If b,, ae +oo and there exists c > 0 such that a, > cb, (resp. dn < —cby) 
for every n => 1, then a, + +00. 

9. Let g be a real number and (a,),>1 be the sequence defined by a, = q”. If 


q > 1, show that a, => +oo. If g < —1, show that (a,),>1 does not converge 
to either +-co or —oo. 


7.2 


10. 
11. 


12. 


14. 


15. 


16. 


17. 
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* Given positive reals a and g, with g < 1, show that a"q”" =i, 
(IMO shortlist) Let (a,),>1 be a sequence of real numbers such that, for every 
m,n € N, we have 


2mn 


OS eae 


Show that the sequence is constant. 


Let (@;)n>1 be the sequence defined by a) = 1,a2 = V1 +1,a, = V1 + De 


ag = Vl+V14+V2,a5 = ¥1+ 14+ V1+4 V2,.... Show that (an)n>1 


is convergent and compute its limit. 


. (Austrian-Polish) Let (a;),>1 be a sequence of positive reals, such that 


Akt2 = fart + Jak, 


for every k => 1. Prove that the sequence converges and compute its limit. 
Let n > | be a given integer and f, f},...,¢, be given real numbers, such that 
to +t) +:++++ t, = 0. Prove that the sequence (a,)x>1, defined by 


ay =toVk +k + 1+: +t,V0ktn 


converges to 0. 

(Romania) The sequence (x,)n>1 is such that /%,41 +2 < x, < 2, for every 
n => 1. Find all possible values of xj9g¢. 

(Leningrad) Let (a,),>1 be a sequence of real numbers such that 


1 


’ 
m+n 


lam + ay — Am-+nl s 


for all m,n € N. Prove that the sequence is an AP. 
(Bulgaria) For each n € N, let 


De Ca ee 
On = ott \ 7 2 ae 


Prove that the sequence (a,),>2 is decreasing and convergent, and compute its 
limit. 


. (Romania) Let k be a fixed natural number and (a,,),>1 be the sequence defined 


by 


dy = Vk + Vkte t+ Vk, 


with exactly n square roots. 
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(a) Show that (a,),>1 is convergent. 
(b) Show that, if k is odd, then the limit of the sequence is an irrational number. 
(c) Find all natural values of k for which sequence converges to an integer. 


19. For each positive real a, let the sequence (a,,),>; be defined by a, = 1 and 


1 a 
da —— we a — a 
l= 5 aan 


for every integer k > 1. Prove that the sequence converges to ./a. 
20. (TT) The set of natural numbers is partitioned into m disjoint, infinite and 


nonconstant arithmetic progressions, of common ratios dj, do, ..., dm. Prove 
that 

: + : tee ele : 1 

di do din 


21. (OIMU) Let c and @ be positive real constantst and Q be a square in the plane. 
Prove that there doesn’t exist a surjection f : [0, 1]  Q for which 


d(f(x),f()) < clx—ylet'/? 


for all 0 < x,y < 1, where we let d(A, B) = AB denote the euclidean distance 
between the points A and B in the plane. 

22. (Turkey) Let (a,),>1 be a sequence of integers such that 0 < dn41—dn < Gn, 
for every natural n. Given real numbers x and y, with 0 < x < y < 1, prove that 
there exist natural numbers m and n such that 

am 


ye ey 
an 


23. (IMO shortlist) Let (a,),>1 be a sequence of positive reals. Show that 
l+a,> Gen 


for infinitely many values of n. 


“Powers of a positive basis with real exponents will be defined in Sect. 10.7. For the time being, 
you may assume that @ is a positive rational, if you will. 
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7.3 Kronecker’s Lemma 


In this section, we apply some of the ideas exposed so far in this chapter to study 
the important concept of dense set, as well as to present an interesting geometric 
application of it. We start by recalling the following definition. 


Definition 7.29 Given an interval J, we say that a subset X of J is dense (in J) if, 
for every a € I ande > 0, it happens that 


XN (a-—€,ate) #9. 


Intuitively, the density of X in J means that X is spreaded all over J. Problem 4, 
page 206, shows that both Q and R\Q are dense in R, whereas Problem 5, page 206, 
shows that the set of dyadic rationals, i.e., rational numbers of the form aes where 
n,k € Z4 are such that 0 < n < 2*, is dense in [0, 1]. 

A quite useful result on the density (in R) of certain of its subsets is the content of 
Theorem 7.31 and Corollary 7.32, which are collectively known in the mathematical 
literature as Kronecker’s lemma.° The proofs we present, albeit not being the 
simplest ones, have the advantage of deriving from a circle of ideas which are 
interesting in themselves. First of all, we need yet another definition. 


Definition 7.30 A nonempty subset G of R is said to be an additive subgroup of 
R if, for all x, y € G, we havex—y eG. 


Evidently, {0}, Z, Q and R itself are additive subgroups of R. For a less obvious 
example, given real numbers x),...,x,, it is immediate to verify (see Problem 1) 
that the set 


Gy, x, = {ax +-++ + axes a,..., a € Z} (7.1) 
is also an additive subgroup of R. 

Now, let G be an arbitrary additive subgroup of R and take x € G. By the above 
definition, we have 0 = x—x € G. Thus, for x, y € G, we also have —y = O-—y EG 
and, hence, x + y = x — (—y) € G; therefore, G is closed under the operation of 
addition. 

Hence, if a € G, then 2a = a +a € G; moreover, if ka € G, for some k € N, 
then (k + l)a = ka + a € G, so that ma € G, for every m € N. Since Oa = 0 EG 
and (—k)a = —ka € G for every k € N, it follows that 


Ga = {ma; me Z} CG. (7.2) 


The coming result collects two central facts on additive subgroups of R. 


> After Leopold Kronecker, German mathematician of the XIX century. 
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Theorem 7.31 (Kronecker) Let G 4 {0} be an additive subgroup of R, and let 
Gi =GNR}. 


(a) Ifinf(G*_) = 0, then G is dense inR. 
(b) Ifinf(G*_) = a > 0, thena € Gand G = Gy. 


Proof 


(a) Suppose inf(G7_) = 0, leta € R and € > 0 be given. We have to show that 


(b 


wm 


GN (a—€,a+e) £4 @. Sincex € GS —x€ G, it suffices to analyse the case 
a> 0.Ifa—e < 0, we have 0 € GN (a —€,a + €) and there is nothing to do. 
Suppose, then, that a—€ > 0. 

The hypothesis inf(G7_) = 0 guarantees the existence of x € G7 such that 
x < 2e. Letting m be the greatest nonnegative integer such that mx < a —€, we 
claim that (m + 1)x € GNM (a—<€,a-+ €). Indeed, if (m+ 1)x > a+e, we 
would have 


mx<a-—€<ate<(m+1)x, 
so that 
x= (m+ 1l)x—mx > (a+ €) — (a-—€) = 2¢, 


thus contradicting the choice of x. Hence, (m + 1)x € (a—€,a+e€)NG. 
Suppose inf(G7_) = a > 0. We initially claim that a € G4. By the sake 
of arriving at a contradiction, suppose that a ¢ G%.. Then, the definition of 
infimum of a set would assure the existence of elements 6B, y € G%. such that 
a < B < y < 2a. However, since G is an additive subgroup of R, it would 
follow from here that y — B € G*_, with 


0<y—B <2a-a=a. 


This contradicts the fact that « = inf(G7_), thus showing that w € G7. 

Now, take any x € G4 and let g = |= | andr = x—a |=]. so that g € Zi, 
0<r<aandx=gqa+r.Ifr> 0, then the fact that G is an additive subgroup 
of R would imply r = x — ga € G*_, with 0 < r < a. Since this contradicts the 
fact that « = inf(G7_), we conclude that r = 0 and, hence, x = ga € Gy. 

Therefore, 


Gi. C {na; n € N} 


and, since the opposite inclusion was already established in (7.2), we actually 
have G7. = {na; n € N}. Finally, since G = G4_ U {0} U G*, where Ge = 
{—x; x € G7}, it is immediate to see that 


G = {ma; me Z} = Gy. 


In the coming corollary, we stick to the notation set forth in (7.1). 


7.3 Kronecker’s Lemma 223 


Corollary 7.32 (Kronecker) Jf a is an irrational number, then the additive 
subgroup Gig = {m+ na; m,n € Z} of R is dense in R. 


Proof By the sake of simplicity of notation, let G = G.. By the previous theorem, 
in order to prove that G is dense in R, it suffices to prove that inf(G*,) = 0. 

If this was not the case, then, once more from Kronecker’s theorem, there would 
exist a positive real number # such that inf(G7_) = B > 0 and G = Gg. Since both 
a and 1 + a@ belong to G, there would exist distinct, nonzero integers m and n for 
which 


a=np and 1+a=m. 


Now, since @ is irrational, the first equality above would give n # 0 and 
B = £ ¢ Q. On the other hand, we would also have 


(m—n)B =(lt+a)—a=1, 


so that B = + € Q. 


m—-n 
We have then reached a contradiction, which came from the supposition that G 
is not dense in R. Therefore, G is indeed dense in R, as we wished to show. oO 


Our next corollary refines the conclusion of the previous one. 


Corollary 7.33 If a is an irrational number, then the following sets are dense 
in R: 


(a) A= {m+na; m,n € Z and m <0 <n}. 
(b) B= {m+na; m,n € Z and n <0 < m}. 


Proof Without any loss of generality, we can suppose that a > 0. Let’s prove item 
(a), the proof of item (b) being totally analogous. Given a € R and € > 0, we want 
to establish the existence of x € A such thata—e < x < a+. Suppose that 
a—e > 0 (the remaining cases are entirely analogous), and let 6 = min{a@, 2e} > 0. 

We first claim that there exist m,n € Z such that m < 0 < nandm-+na € 
AN (0,4). By contradiction, suppose that 


m+naeANn(0,6)>n<0. 
Choose (by the former corollary) x9 = mo + now € AN (0,6), with the greatest 
possible ng < 0. Since Kronecker’s lemma guarantees that AM (0, xo) is infinite, we 
can take x} = m, + nia €AN (0,x0), with ny < no. Then, 
x9 — X1 = (Mp — m) + (m9 —N)a EAN (0,x9) CAN (0,5), 
which is a contradiction, for, 9 —n, > 0. Hence, we can choose m+na € AN (0, 4), 


with n > 0. This being said, if it were m > 0, we would have m+ na > a > 6, 
which is another contradiction. Therefore, m < 0 and, thus, AN (0,6) 4 @. 
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Now, take x € AM (0,6) and consider all numbers of the form kx, with k € Z,. 
Letting kp be the greatest nonnegative integer for which kyx < a — €, we claim that 
(ko + 1)x € (a—€,a + ¢€). Indeed, if it were (ky) + 1)x > a+ €, we would have 


kox <a—€ <at+e< (ko + 1)x 
and, hence, 
6 > x = (ko + Ix—kox = (a+ €) —(a—€) = 2. 


This contradicts the choice of 6. oO 


The discussion of the following example uses a few simple facts on plane 
Euclidean Geometry, for which we refer the reader to [4]. 


Example 7.34 (Brazil) Let TI be an euclidean plane and f : TI — IT be a function 
such that 


d(P,Q) = 1=> d(f(P),f(Q)) = 1, 


for all P, Q € II. Prove that f is an isometry of IT, i.e., prove that, for all P,Q € TI, 
one has d(P,Q) = d(f(P),f(Q)), where d(X, Y) = XY stands for the euclidean 
distance between the points X and Y. 


Proof For P € II, we let f(P) be systematically denoted by P’, so that d(P,Q) = 
PQ and d(f(P),f(Q)) = P’Q’. Firstly, let’s show that f must preserve segments of 
length /3. 


Claim 1 


PO = V3 > PO = V3. 


Indeed, given points P and Q in the plane for which PO = J/3, let’s construct 
points R and S such that both QRS and PRS are equilateral triangles whose side 
lengths are equal to 1| (cf. Fig. 7.1). 


Fig. 7.1 PO=/¥3> 
PO =W3 
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Fig. 7.2 PO=2=> S Tv 
PO =2 


P R Q 


Let us counterclockwise rotate rhombus PRQS with center at P, until we get a 
rhombus PTUV such that OU = 1. 

Observe that the images P’, R’ and S’ of P, R and S form an equilateral triangle of 
side lengths equal to 1. Since Q’R’ = Q’S’ = 1, it follows that Q’ = P’ or P’R’Q'S’ 
is a rhombus congruent to PRQS (so that P’Q’ = /3). In order to discard the first 
possibility, it suffices to note that, if it were P’ = Q’, then 7’, U’ and V’ would all be 
points on a circle centered at P’ = Q’, while being vertices of an equilateral triangle 
of side lengths equal to 1, which is an absurd. 


Claim 2 For every positive integer n, we have 


PO =n=> PO =n. 


It suffices to establish the case n = 2, the general case being totally analogous. 
Let P and Q be such that PO = 2, and let R be the midpoint of PQ, such that 
PR = RO = | (cf. Fig.7.2). Let’s consider points § and T such that PRS, RST 
and QRT are equilateral triangles of side lengths equal to 1, all situated on one of 


the half-planes determined by line PQ. Using claim | twice, it’s immediate that 
P'QO' =2. 
Analogously, we can prove that 


PO = nvV3 > PO = nv. 


Claim 3 P’Q’ > PO, for all points P and Q in the plane. 


In order to prove this claim, let PO = 1, such that / is neither a natural number 
nor a real number of the form nvV3, for some n € N. By Corollary 7.33 (see, 
also, Problem 3), we can take sequences (m,)z>1 € (nz)x>1 Of integers satisfying 
the following conditions: 


i. mg < 0 < m, for every k > 1; 
ii, Limp+oo(me + meV3) = 1; 
iii. max{0,/—1} < my + nV3 <1, for every k > 1. 


Let’s first show that there exists a triangle of side lengths 7, —m, and nz V3. To this 
end, since my +nzV/3 < 1, we have] + (—m,) > n/3; also, 1 + (m, + nx J/3) > 0, 
which implies / + mnv3 > —m,; finally, from 7 — 1 < my + nwJ/3 we get 


4v3 + (—m,) > nv3 + mt+1>1. 
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R —™Mk 
neV3 —™Mk f Q’ 
nevV3 
l 
P i Q ‘Pt 
Fig. 7.3. P’Q’ > PO 
Fig. 7.4 PO =Il> xX 


PO =! 


Therefore, triangle inequality assures the existence of a point R € IT \ PQ such that 
PR= nV3 and RO = —mkx; since we already have PO = I, there is nothing left 
to do. 

It follows from what we did above (cf. Fig. 7.3) that P’Q’ + R’Q’ > P'R’ or, 
which is the same, P’O’ > P’R’ — R’O’. However, since P’R’ = ny V3 and R’O! = 
—m,, we get P'O! > ngV/3 + mg. On the other hand, since ng/3 + mz ea 1, it 
comes that PO’ > 1= PO. 

Let’s consider again points P and Q in the plane, with PO = 1. Tesselate the 
plane with equilateral triangles of side lengths all equal to 1, such that one of these 


—_ 

triangles has one of its vertices at P and one of its sides on line PQ (cf. Fig. 7.4). 

By what we did above, the images by f of the vertices of such a triangulation form 
the vertices of an analogous triangulation. On the other hand, if X is an arbitrary 
vertex of the original triangulation (see Fig. 7.4 once more), then X’Q’ > XO, for 
every point Q of the plane. Geometrically, this means that Q’ doesn’t belong to the 
interior of the disk centered at X’ and having radius XO. However, since this is true 
for every vertex of the triangulation, we must necessarily have P’Q’ = I. oO 


Apart from the following problems, other interesting applications of Kronecker’s 
lemma will appear in the context of continuity in Problem 14, page 264, as well 
as in Sect. 10.8 (cf. Examples 10.59 and 10.60), when we have at our disposal the 
concepts and elementary properties of logarithms. 
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Problems: Section 7.3 


1. * Given real numbers x), ..., xx, verify that the set 


ss xp = {Gix1 + +++ + aexk; 1, ..., ae € Z} 
is indeed an additive subgroup of R. 
For the next problem, the reader might want to recall the definitions of integer 
part and fractional part of areal number, given in Problems 9 and 10, page 152. 
2. The purpose of this problem is to give a direct proof of Corollary 7.32. To this 
end, givena € R\ Qa eé Rande > 0, start by choosing p € N such that 
- < 2e€ and do the following items: 


(a) Show that at least two of the numbers {a}, {2a}, ..., {(p + la} belong to 

a single interval of the form [ s a) for some integer k satisfying 0 < k < 
D. 

(b) Use the result of (a) to show that there exist m’,n’ € Z such that 0 < 
m' +n'a < . 

(c) Use the result of (b) to show that there exists r € Z such that r(m’ + n'a) € 
(a—€,a+e). 


3. * Givena,/ € R, with @ irrational, show that there exist sequences (m,)x>1 and 
(nx)x>1 Of integers satisfying the following conditions: 


(a) mg < 0 < ng, for every k => 1; 
(b) limy+oo(mp + ma) = I. 


For the next problem, we assume from the reader some familiarity with the 
basics of plane analytic geometry and vector algebra in the plane. We refer to 
Chaps. 6 and 8 of [4] for the necessary background. 

4. Asubset X of an euclidean plane IT is said to be dense in z provided X intersects 
every disk of II. Now, let O be a fixed point in IT. A nonempty subset X of IT is 
said to be an additive subgroup of II with respect to O provided the following 


condition is satisfied: for every A, B € X,if OA — OB = OC, thenC € X. 
(a) IfA,,...,A, € TI and 


k=1 


show that X4, 
not dense in it. 


(b) Choose A and B in IT such thatO ¢ AB. Ifa € R \ Qand 


A, 18 an additive subgroup of IT with respect to O, which is 


Yap = {mOA + (n + pa)OB: m,n,p € Z}, 
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show that Y4 2 is an additive subgroup of IT with respect to O, which is not 
dense in it. 

(c) Give an example of an additive subgroup X of I with respect to O, different 
from IT itself and dense in it. 

(d) Let X be an additive subgroup of IT with respect to O. Do the following 
items: 


i. Given a line r through O, look at it as a real line, with O representing 0. 
Prove that X M r is either the empty set or an additive subgroup of r. 
ii. If there exist distinct lines r and s through O such that KN r,XNs 4 @, 
prove that X is dense in IT. 
iii. If X is not dense in IT and is not contained in a single line, prove that 
there exists a direction d in IT such that X is contained in the union of a 
family of equally spaced lines, all parallel to d. 


Although the next problem does not really use Kronecker’s lemma, this is the 
best place to put it. 

. Leta and £ be positive irrationals such that + + 3 = |. Our purpose is to show 
that the sets 


{|ka]; ke N} and {|kB]; ke N} 


form a partition of the natural numbers. To this end, do the following items: 

(a) Show that if a > 1 is irrational and n € N, then n is a term of the sequence 
([kor|)jo1 if and only if {2} 1-2) 

(b) Given n € N, show that {2\ ee {3} =e 

(c) Conclude that either {2| >1- + or {31 >1- 2 but not both. 

(d) Finish the proof. 


In the notations above, one says that (a,)x>1 and (bg)x>1, given by a, = [ka] 
and by = [kB |, are the Beatty sequences® corresponding to a and B. 
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Let (an)n>1 be a sequence of real numbers. By the series 


® After Samuel Beatty, Canadian mathematician of the XX century. 
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or simply eae dn, We mean the sequence (S,,)n>1, Where Ss, = a, +a2+--:+a, for 
n> 1. The real number s, is called the n-th partial sum of the series et dy, and 
we say that such a series converges to s € R if the sequence (s,)n>1 of ‘its partial 
sums converges to s. In this case, we say that s is the sum of the series and write 


> dn = 5. (7.3) 


n>1 


In other words, whenever we write }°,., ax = s, we will be saying that the terms 
of the sequence of finite sums s, = a, + da) +--+ + a, come closer and closer to 
the real number s, as long as n —> +00. It is in this sense that equality (7.3) must be 
thought of, as a limit. 

We shall sometimes have a sequence (d,)n>0 Of reals, in which case the 
corresponding series will be denoted by }°., dn. We leave to the reader the 
(immediate) task of adapting the former and coming discussions to such a situation. 

Our main interest in this section is to find out efficient criteria to decide whether 
a given series converges or not. If it doesn’t converge, we shall say that it is a 
divergent series. Let’s see two simple examples of divergent series. 


Example 7.35 The series )°,., k and), ;(—1)* diverge. 


Proof The first series diverges, for its n-th partial sum is s, = 1+2+:-:-+n= 


aoe) so that (s,)n>1 is a divergent sequence. In the second case, the n-th partial 
sum S,, of the given series is such that s, = 0 ifm is even and s, = —1 ifn is odd, so 
that (s,,)n>1 is also a divergent sequence. oO 


Given a series }°,.| dn, we refer to a generic term a, as the general term of the 
series. The following proposition gives a necessary condition on the general term of 
a series, if it is to converge. 


Proposition 7.36 If the series )°,., a; converges, then ag iG), 


Proof Given € > 0, we want to prove that there exists no € N such thatn > no > 
|a,| < €. To this end, let 7 = }°,., ag. By the definition of convergence for series, 
there exists np € N such that 


€ 
nz no = |(a1 + a2 +--+ an) — i] < 5. 


Therefore, it follows from the triangle inequality that, form > no, we have 


ldn| < (a, +a. +-+-+a,) —1] + |l— (ay +a) +-++ + ay-1)| 
se a 
~+- =e. 
= 2, 2 
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The converse to the above proposition is not true, namely, there are divergent 
series )°,. , ae for which a, —". 0. The classical example is that of the harmonic 


series, ic., the series )>,., i whose divergence is established in the coming 
example and will find further use in these notes. 


Example 7.37 Given n € N, let m be the only natural number such that 2” <n < 
2"+!_ Then, 


een es | (7.4) 
= Std 


In particular, the harmonic series diverges. 


Proof Note that, for every integer k > 1, 


1 di 1 eee 1 1 it 1 Watt 1 1 
a Lae: eit rs rear i nee ‘oleae —— 
Qk-1 4 J 2k-1 4.2 Qk Dk Dk 2k 2. 

ee 
2k—1 times 
Hence, 
n gm 
1 1 1 1 
1+-+ ->14+=++ = 
2 D5 = a em 
j=3 j=3 
fit.“ j 1 
=1l1+—-+ 
05 Pia =) 


oO 


In what comes next, we shall show that, for every rational r > 1, the series 
ree r converges. To this end, we need to examine the convergence of a geometric 
series, i.e., of a series of the form 


ya 


k>1 


for a certain nonzero real number gq. In this sense, we have the following important 
result. 


Proposition 7.38 Given q € R \ {0}, the geometric series )),5, gq‘ converges if 
and only if 0 < |q| < 1. Moreover, if this is so, then 


1 


vig =—. 


k>1 1-q 
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Proof If |q| = 1, the geometric series diverges, since its general term g‘~! doesn’t 
converge to 0. Suppose, then, that 0 < |g] < 1, and lets, =1+q+-:-+q"! be 
the n-th partial sum of the series. By the formula for the sum of the terms of a finite 
GP, we have 


Sn = = 


Hence, in order to show that the series converges to = it suffices to show that 

q" — 0. But this was done in Example 7.12. Oo 
We can now discuss the promised example. 

Example 7.39 If r > 1 is rational, then the series )~ 1 t converges.’ 


Proof By the Bolzano-Weierstrass theorem, it suffices to show that the sequence 
(Sn)n>1 Of the partial sums s, = yy r is bounded. To this end, givenn € N, take 


m € N such that 2” > n. Then, 


1 1 1 1 
Fae | — — ee — sae =— 
ieee (5 * =) ss (aay Ore x) 


l 2 1 4 1 gn-l 1 
< + . ia + . Vv + eee + . 50=Dr 
1 : AGH: 
<I+ 3-1 + qt a 2(m—-1)(r7-D) 
1 
<)o 2-Dk- 
k>0 


However, since r > 1, we have 0 < sor < 1, and it follows from the previous 


proposition that 


) .. & 
Do ene ae 


k>0 


r—1 
Therefore, we conclude that 0 < s, < = for every n € N, so that the sequence 


(Sn)n>1 1s, indeed, bounded. oO 
Remark 7.40 For the sake of curiosity, we inform the reader that }°;,.; a = f(r), 
where £ : (1,+00) — R stands for the famous Riemann’s zeta function.® 


7Had powers k”, with r > 0 real, been defined (this will be done in Sect. 10.7), the reasoning 
presented in the proof would work equally well to show that the series Det a converges. 

8 After Bernhard Riemann, German mathematician of the XIX century. For more on Riemann, see 
the footnote at page 354. 
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An elementary computation of ¢(2) = x will be hinted to at Problem 12, page 470. 
We also refer to Chap.9 of [5]. Note, however, that for an odd natural number 
m > 1 the computation of the exact numerical value of €(m) is an open problem in 
Mathematics. 


We finish our initial list of examples of series with an additional application of 
Bolzano-Weierstrass theorem to the convergence of series. In the coming example, 
we introduce one of the most important constants of Mathematics, the number e, 
which will play a major role in Sect. 10.7. 


Example 7.41 The series )>,. a converges to an irrational number e, such that 
2 < e < 3. Insymbols, 


e=) —. (7.5) 


Proof Let (s,)n>0 be the sequence of the partial sums of the given series, i.e., 


= 1 1 1 
Sn = tata tet 
For this sequence, we clearly have 1 = sg < 5s; < sy < +++; on the other hand, since 


k! > 2‘! for every integer k > 2, we have, for an integer n > 4, 


<4 or ree | 8 1 35 
n= Se et gs ge = 
k=0 k=4 k>4 
where we used the formula for the sum of a geometric series in the last equality 
above. Hence, the sequence (s,),>0 is increasing and bounded, thus, convergent. 
Now, item (b) of Proposition 7.15, together with s3 = 2 ands, < = for every 
integer n > 4, gives 2 < e < 3 (for an approximation of e with five correct decimal 
places, see Problem 8). 
We shall now show that e is irrational.’ To this end, observe initially that, for 
natural numbers 1 < n < m, we have 


m 


1 1 
Sm — Sn = > as pS kt 


k=n+1 ~ k>n+1 


1 1 1 
=aem (ta taenerst) 


1 1 1 2. 
ee (7.6) 
(4-1) (n+2)k (n+1)! n+1 


°For the reader’s knowledge, we observe that the number e is, indeed, transcendent, 1.e., it is not 
the root of a polynomial with rational coefficients. A proof of this fact is beyond the scope of these 
notes, and can be found in [11]. 
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where, once more, we used the formula for the sum of a geometric series in the last 


equality above. 
Therefore, yet for natural numbers 1 < n < m, it follows from the above 


computations that 


“1 1 n+2 
m — Sn = < Sn 7. AME ae a 
mast DO Tae wel 
k=n+1 
and item (a) of Proposition 7.15 gives 
~ i Jee 1 n+2 
Gee Pe (n+ 1)! n+1 
Thus, 
1 n+2 
Sn < e < Sn 


Th, sae. 
Multiplying this last inequality by (nm — 1)! and noticing that sy = s,—) + , we 
conclude that 


n+2 


1 1 
— 1)!s,_ — <(n—l)!le < (n—1)!s,_ — + ————_., 
(n—1)!s i (n— 1)!le < (n—1)!s a Ces: 


Writing ¢, = n!s, € N and observing that 


1 2 
—— + —— 
(n+ 1)? n(n+ 1) 


1 n+2 1 
n 
1 1 2 21 
3 


a Mae? 7 


+ 


= = = — <1, 
+getz ag = 3 


for every integer n > 2, we finally arrive at the estimate 
tr < (n-—Dle<t-14+1, 
which is valid for every integer n > 2. 
Now, suppose that e = a with p,q € N. Making n = q+ 1 > 2 in the above 
inequalities, we would get 


ty <(q—l!p < ta +1, 


with t, € N. This is a contradiction. oO 


As an alternative to (7.5), we have the following result, which will be quite useful 
in Sect. 10.7. 
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Theorem 7.42 ¢ = lim,++00 (1 + 4)". 
Proof Let a, = (1 + 1)”, Arguing as in the proof of Example 7.23, we get 


n n 1 
eae S GE 
k=2 


1 n(n—1)(n—2)...(n—k +1) 


m24 04. nk 


k=2 


—2* 


The above computations also give 


aa | 1 2 k-1 


n+1 
2 k-1 
2 f= (1- gai 
‘ > —)( —) ( —) a 


Therefore, (a,),>1 is monotone increasing and bounded above, and hence there 
exists ] = limy—++00 dy. In particular, since a, < e for every n € N, item (b) of 


Proposition 7.15 gives / < e. 
Also from the computations above, given natural numbers n > m > 2 we can 


write 
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Therefore, it follows from Problem |, page 218, we have 


1 2 m—1 aa 
l= li n> ii f= 1 = 5) (1= Oa So 
Pru a tim ( -) ( =} ( n ) ( d 7 


However, since m € N was arbitrarily chosen, we conclude that / > oe 


for every m € N. Then, letting m — +00 and invoking again item (b) of 
Proposition 7.15, we finally obtain 


oO 


Back to the general development of the theory, the next result is the analogue, for 
series, of Proposition 7.18, and teaches us how to operate with convergent series. 


Proposition 7.43 If )°,., ax and >>, by are convergent series and c is a real 
number, then: 


(a) doge1 Ca converges and Dy) Cay = Coys A 
(b) ee (ax + b,) converges and Dee (ak + by) = ae art ei by. 


Proof 


(a) If s, denotes the n-th partial sum of the series }°,., ax, then the n-th partial 
sum of the series ps1 Ck equals cs,. Hence, according to item (a) of 
Proposition 7.18, )>,. , cag converges, and 


) ca, = lim cs,=c lim > Ak. 
+ 


n—>-+o0o0 n—>+0o0 
k>1 k>1 


(b) If s, and ft, are the n-th partial sums of the series }°,., a, and >°,., be, 
respectively, then the n-th partial sum of the series }°,. , (az + by) equals 5, + tn. 
Therefore, item (b) of Proposition 7.18 assures the convergence of this last 
series, with 


bh) = ntt)= ii n lim t, = Dx. 
re ae re re ee 
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A quick analysis of the arguments presented in Examples 7.39 and 7.41 gives the 
following more general result, known as the comparison test for the convergence 
of series. 


Proposition 7.44 Let (ax)x>1 and (bx)k>1 be sequences of positive real numbers, 
such that a, < b,x for every k = 1. If the series Dit by converges, then so does the 
series ad a,x. Moreover, 


Proof Letting s, = )-y—, ae and t, = >, by, it follows from 0 < ay < dy 
that 0 < s, < t,, for every n € N. Since the sequence (¢,),>1 converges, it is 
bounded. Hence, the sequence (s,,),>1 is monotonic and bounded, thus convergent, 
by Bolzano-Weierstrass theorem. To what is left to prove, it suffices to make n > 
+oo in the inequality s, < ft, and apply the result of Problem 1, page 218. oO 


Example 7.45 Is there a sequence (a,),>1 of positive real numbers such that both 
series )0,., ax and )>,., Pa converge? 


Solution Suppose there is such a series. Then, item (b) of Proposition 7.43, together 
with the inequality between the arithmetic and geometric means, would give us 


1 1 2 
a+ >a Pa me ae aa = ~ . 


k>1 k>1 


Therefore, by the comparison test for series, the harmonic series would be conver- 
gent, which is an absurd. oO 


The coming example uses the theory of series to give a proof of the uncountabil- 
ity of R. 


Example 7.46 Problem 23, 172, shows that the family ¥ of infinite subsets of N is 
uncountable. On the other hand, if A = {m, < m2 < m3 < ---} is sucha set, — 
the comparison test, together with the ee of the geometric series )* jl oT : 
guarantees the convergence of the series }°,., or: 

Let B = {n, < nz < nz < ---} be another infinite subset of N. If we show that 


~~ Jin 7 =o Qn? 


k>1 


then the correspondence A > >> al aa defines an injection from ¥ into R, and this 
guarantees that R is uncountable. (Otherwise, by composing such a function with a 
bijection from R to N, we would get an injection from ¥ to N, thus contradicting 
the uncountability of F.) 


7.4 Series of Real Numbers 237 


What is left to do is quite similar to the proof of Example 4.12. Indeed, suppose 
we had 


ee 
k>1 2m k>1 2m 
Then, 
1 1 1 1 1 
amy Qin = be an = a = Qni-1 
> k>1 j=n 


so that m; => n,. By reversing the roles of the two series, we analogously conclude 
that m, <n,, so that m; = n,. Thus, 


1 1 
2 2m = On ? 


k>2 k>2 


and a similar reasoning gives mz = nyo. Finally, by continuing this way, we get 
my = nx for every k => 1, so that A = B. 


Back to the development of the theory, for a series }°,., ax with infinitely 
many positive and negative terms the results obtained so far say nothing about its 
convergence. We remedy this situation from now on, starting from the following 


Definition 7.47 A series }°,.. ; ax is said to be absolutely convergent provided the 
series )°,. |ax| converges. 


The usefulness of the concept of absolutely convergent series stems from the 
coming proposition, as well as the subsequent example. 


Proposition 7.48 Every absolutely convergent series is convergent. 


Proof Let )>,., a be an absolutely convergent series and, for each integer n > 1, 
let s, = a, tay +---+ a, and t, = |a,| + |a2| +---+ |a,| be the n-th partial sums 
of the series )°,., ax and )°,., |ax|. Given integers m > n > 1, we have 


[Sm — Sy = |dn-+1 + Gn42 tee + An| 


IA 


lant] + |anz2| + +++ + lanl 
= ty— th. 
Since (t,)n>1 converges, it is a Cauchy sequence; hence, given € > 0, there exists 
no € N such that m > n > no => |tn — t,| < €. With these € and no, it follows from 


the above inequality that 


m>n>n=> [Sm — Sn <Stn-th <€, 


238 7 More on Real Numbers 


and (S,)n>1 18 also a Cauchy sequence. Therefore, Theorem 7.27 guarantees the 
convergence of the sequence (s,,),>1, aS we Wished to show. Oo 


The converse to the previous proposition is not valid, namely, there are conver- 
gent series which are not absolutely convergent. The classical example is a direct 
application of the coming result, which is known in the mathematical literature as 
the Leibniz criterion!® for the convergence of alternate series. 


Proposition 7.49 (Leibniz) If (an) n>1 is anonincreasing sequence of positive reals 
such that ay, — 0, then the alternate series Ys (Hl) hax converges. 


Proof Foreachn €N, let s, = aj+a2+---+an. Condition a; > a2 > a3 >--- >0 
easily gives 


S12 83 2 S5 Sr = So = S84 = Sd. (7.7) 
On the other hand, for each m € N we have 
|Sam—1 _ S2m| = dam > 0, 


which clearly guarantees, in conjunction with (7.7), that (S,),>1 is a Cauchy 
sequence. Therefore, (s,),>1 is convergent, as desired. oO 


Example 7.50 The alternate series )~ kel ey converges, by a simple application 
of the former proposition (we shall compute its value in Problem 8, page 484). 
Nevertheless, the series formed by the absolute values of its terms (the harmonic 


series) diverges. 


We now discuss quite a useful criterion for the convergence of series of nonzero 
real numbers. It is based on the asymptotic behavior'' of the quotient of neighboring 
terms of the series, and is known as the ratio test. 


Proposition 7.51 Let (a,),>1 be a sequence of nonzero real numbers, such that 


a — 1. Ifl <1, then the series eet a; is absolutely convergent; if 1 > 1, then 


the series ) >, a is divergent. 


Proof Let’s prove that the series ei a, is absolutely convergent if / < 1 (the 
proof of its divergence in case / > 1 is completely analogous). 


‘0 After Gottfried Wilhelm Leibniz, German mathematician and philosopher of the XVII century. 
Together with Sir Isaac Newton, Leibniz is considered to be one of the creators of the Differential 
and Integral Calculus. Up to this day, we still use some of the notations invented by Leibniz more 
than 300 years ago. 

Tn this context, this expression refers to the behavior of some expression that depends on n € N, 
when n — +00. 
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Letting / < 1, we can take a real number g such that / < g < 1. The convergence 


n . 
Mott —> lassures the existence of ng € N such that 


lan+1| < 
lan 


n=n=> 


Hence, for n > no, we have 


n—-1 


lan+1| - 
lan] = lanl T] < |aniq"—. 
lax 


k=no 


Thus, for n > no, the terms of the series }°,., |ax| are majorized by those of the 
series >>, [dn |g” ™, which converges, by Propositions 7.43 and 7.38. Therefore, 
it follows from the comparison test that x1 |a%| converges, which is the same as 
saying that )°,. , ax is absolutely convergent. Oo 


In the notations of the former proposition, we observe that, if ] = 1, then the 
series opt a; May converge or diverge. Indeed, for a, = 4 we have 


albeit the series )~ 1 i diverges; on the other hand, for a, = + we have 


ak+1 k2 k 
OE ae, 
ak (k+ 1)? 


while the series )7,. | z converges. On the positive side of things, we present the 
following 


Example 7.52 Given a natural number m and a real number g > 1, explain whether 


> -_ m . 
the series }~ we (—-D* oe converges or diverges. 


Solution Letting a, = (—1)""! a forn > 1, we have 


Gn+1 
an 


i" ¢ WV" 1 on 1 
oem £-(4 yore 
gq’ nm n qd qd 


Therefore, by the ration test, the given series is absolutely convergent, hence, 
convergent. oO 


We close this section by discussing the product of two absolutely convergent 
series. 


Theorem 7.53 Let ae a; and pa b; be absolutely convergent series. If 


k-1 
Ch = ~ ajb; = Se aibi-i (7.8) 


i+j=k i=l 
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fork = 1, then ‘Ee cx is absolutely convergent and such that 


Da=(La)(L4). 


k>1 i>1 jel 


Proof It suffices to show that, given €« > 0, there exists no € N such that, forn > no, 
we have 


ra ( 2a)(204) |<e and ra ( Xa)(24)| <€, 


Let’s guarantee the existence of no € N for which the first inequality above is true 
(the analysis of the validity of the second inequality is entirely analogous). 
Given n €N, it follows from triangle inequality that 


ea-(La)(L}] =|L(D4")- (LaLa) 


+j=k i=1 j=l 


“(SN E5)-(Hay(Eo) 


+(L(Ls)-(Lf(Lh 


Let A, B and C denote the first, second and third summands of the right hand side 
above, respectively, so that 


B= a 
i>n 


. Io, and C= oa 
j=l i>1 


(Lal 
jon 


The sequence (4 bi) , being convergent, is bounded; therefore, there 
n>1 


exists M > 0 such that | pa 1 b| < M, for every n => 1. On the other hand, since 


the series ));., a; and )/,., bj converge, we have 


Ya ae va Da F420 


i>n i=1 i=1 i=1 


and, analogously, >~ pal —+ 0. In order to estimate C , we can suppose, without 


loss of generality, that Si a; # 0. Then, we can choose 11, 2 € N such that 


n>n> | oa 
i>n 


€ € 
<— andn>n = |0o| < 
3M . lB yea 
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hence, for n > max{71,n2}, we have 


€ 
B< = -M=~= and C<| a;\- —=———— = -. 
3M dX, aaa, 
In what concerns A, notice firstly that 
2n n n 
A=|P(¥ a)-(La)(La)|=| Lo ad 
k=1 itj=k i=1 j=l max{ij}>n 


i+j<2n 


2n 2n 
Yo laibil < D> Yo lallel+ YS Yo lailldy 


= 
max{ij}>n i=n+1 j<n j=n+l i<n 
i+j<2n 
2n 2n 
<(  lal)(S let) + (Yo il) ail). 
i=n+1 jel j=n+l i>1 


To estimate A, we can suppose that }7;.; |ai| # 0 and >) ,., |b;| 4 0. Now, 
since the series }7;., |a;| and }?,., |bj| converge, the sequences (emer |ail),,., and 


(5 11) are Cauchy; therefore, there exist 13,4 € N such that 
n>1 


2n 2n 
€ € 
n>mn> ) lai| < —————-_ and 2 > > ) |b;| << ——_.. 
Sal 6 >I [jl j=ntl 6 dirt lail 


Then, form > max{n3,n4}, we get 


€ 
6 ys [Dil pa 55, il Dla 


jz i>1 


Finally, by letting n9 = max{n,, 12,73, n4} and taking n > no, all of the previous 
estimates are valid, so that 


| oe -(Soai)(0a)| sat B+C<3-5= 


k=1 i>1 jz1 
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1. Let (dn)n>1 be a sequence of positive real numbers defined by a; = ; and 


Ant = a + dy, for every n € N. Prove that oe <r converges and show 
that 


1 
re 


k>1 


2. Sequence (a,)n>1 is a nonconstant AP of nonzero real numbers. Prove that 


(Apa converges and compute its sum. 
3. Given a real number a > 1, prove that the series )°,. a converges and 
compute its sum. 


4. Decide whether the series }°,., converges or diverges. 


1 
VktVe=1 
: 1 
5. Prove that the series )°;. 1999 Taio converges. 
6. Let (a,)n>1 be an infinite, nonconstant AP of positive terms. Prove that: 
1 . 
(a) dogs z, diverges. 
1 
(b) Yoest ar converges. 


7. (NMC) Let A be a finite set of naturals, all of the form 2°3°5°, for some 
nonnegative integers a, b and c. Prove that 


yo <4 


xEA 


8. * The purpose of this problem is to show that e & 2.71828, with five correct 
decimal places. To this end, do the following items: 


(a) For every integer n > 10, show that 


ee es Sa a eee 
10! 11! n! 10! ll 1? fi20 7” 


(b) Use item (a), together with the fact that 10! > 2- 10°, to show that 0 < 
e—Yieig < 10°. 

(c) Conclude from (b) that 2.71828 approximates e with five correct decimal 
places. 


9. * Given a sequence (a), d2,a3,...) of digits, prove that there exists a single 
x € R such that, for a givenn € N, we have 


TA 


10. 


11. 


12. 


13. 


14. 


15. 


16. 
Vis 


18. 
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for every natural number k > n. In such a case (and as the reader is certainly 
used to), we write x = 0.a,da)a3... and say that 0.a;a2a3... is the decimal 
representation of x. 

Show that every real number x € (0, 1) admits a unique decimal expansion of 
the form x = 0.a,a2a3..., with a, 4 0 for infinitely many values of n. Then, 
use this fact to construct a surjective function f : [0, 1] — [0, 1] x [0, 1]. 

Let (d;)n>1 be a sequence of real numbers such that }°,., ag converges. Prove 
that, for every rational a > 5, Yael gr also converges. 

Let (a,,)n> 1 be a sequence of positive real numbers, such that the series )* po Uk 
converges. Prove that }°,., ./acac+i also converges. 

Let (F;,)n>1 be the Fibonacci sequence, i.e., the sequence defined by F; = 1, 
Fy, = 1 and Fy42 = Fry, + Fy, for every integer k > 1. Show that the series 


Bes i converges. 
Let (@,)n>1 be a sequence of positive reals such that 2/a, —> L. 


(a) If 1 < 1, show that the series }°,. , a, converges. 

(b) If 7 > 1, show that the series pee a; diverges. 

(c) If 7 = 1, give examples showing that the series }7,. , a, may converge or 
diverge. 


The convergence criterion given by the case / < | is known as the root test. 
Let )°,. a be an absolutely convergent series, with }°,., a, = 0. Show that 


ai a2 an-1 
Lente tte) 


n>1 


converges and compute the corresponding sum. 


_4)k 
Prove that e~! = pane co 7 : 


Let )° x>1 4 be an absolutely convergent series and gy : N — N bea bijection. 


(a) If a, = 0 for every n € N, prove that )°,., ak = Yor) do: 

(b) Write a, = at —a,, where at = max{a,,0} and a, = —min{ay, 0}, for 
every n €N. Prove that )°,., ae and >°,., a are both convergent. 

(c) Conclude that, in general, re a= es Ag(k)- 


In view of item (c) above, we say that an absolutely convergent series is 
commutatively convergent. 


The convergence criterion for series stated in the next problem is due to N. 
Abel, and is known in the mathematical literature as Abel’s convergence test 
or Abel’s convergence criterion. 

Let (a,)n>1 and (bn)n>1 be two sequences of real numbers satisfying the 
following conditions: 


(a) The sequence (s,),>1, defined by s, = a; +---+ a, for every n € N, is 
bounded. 
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19. 
20. 


21. 


22. 
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(b) by} > by > b3 > ++» > Oandb, —> 0. 


Prove that the series )~ kel a,b, converges. 
Show that Abel’s criterion implies Leibniz criterion. 
Do the following items: 


(a) Given a,h € R, with h 4 2/n for every 1 € Z, show that 


k sin (a + tout) sin erph 
sin(a + jh) = —— 
j=0 2 
(b) Use Abel’s criterion to show that Vet as converges. 
In a cartesian coordinate system centered at O, let (An)n>1 be the sequence of 
points such that Aj = (1,0) and: 


(i) Triangle OA,A,+1 is rectangle at A, and such that A,A,41 = 1. 
(ii) Triangles OAy+1Ayj+2 and OA,A,+1 have disjoint interiors, for every n > 1. 


Prove that, when n — +00, half-line OA, revolves infinitely many times 
around O. 

Let T,, be a right triangle whose side lengths are 4n”, 4n* — 1 and 4n* + 1, with 
n €N. Let a, be the measure, in radians, of its internal angle opposite to the 
side of length 4n”. Show that 


So ax = >. 


k>1 


Chapter 8 
Continuous Functions 


In this chapter we formalize the concept of continuous function, intuitively thought 
of as that of a function whose graph is a curve without interruptions. As a result of 
our discussion, we shall present sufficient criteria for a function to be continuous 
and, among other important results, shall show that every continuous function 
possesses the intermediate value property. Also, several interesting examples are 
scattered throughout the chapter. 


8.1 The Concept of Continuity 


Let’s initially consider the function f : R > R given by f(x) = {x} (the fractional 
part function), whose graph is sketched in Fig. 8.1. After a quick look at it, we 
would certainly feel comfortable in saying that such a graph is discontinuous, for, 
it presents several (actually, infinitely many) jumps. Also, note that this apparently 
doesn’t happen with the graph of the function g : R > R given by g(x) = x’, which 
should surely be called continuous. 

It does emerge the question of how to find a reasonable criterion to identify 
the existence or absence of jumps in graphs, thus discerning between the two 
possibilities above. In order to develop some intuition on how to do it, let’s restrict 
the domain of the function x > {x} to the interval [F. 3]. If we denote this new 
function still by f, then we easily conclude that 


1 3 
im = [0.2] of2.1) 


in particular, there exist values of y in the closed interval bounded by f (3) and f (3), 
for example y = 2, such that no x € [;. 3] satisfies f(x) = y. 
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Fig. 8.1 Graph of x > {x} 


On the other hand, it is easy to verify that this doesn’t happen with the function g 
of the first paragraph. More precisely, let’s fix an interval [a, b] C R and let g({a, b]) 
denote the image of the restriction of g to [a, b], i.e., 


8([a, b]) = {g(x) € R; x € [a, b]}: 
then, by setting c = max{—a, b}, it’s easy to show that 


la |), if0<a<b 
(a,b) =) [0.c?], ifa<O<b. 
[oa |, ifa<b<0 


In particular, for every y in the closed interval bounded by g(a) and g(b), there exists 
x € [a,b] (x = £,/, according to the case at hand), such that g(x) = y. 

We are now in position to generalize the previous discussion with the following 
definition. To this end, in all that follows, unless we explicitly state otherwise, J 
denotes an interval and X a union of intervals of the real line. 


Definition 8.1 A function f : X — R has the intermediate value property (cf. 
Fig. 8.2) if, for every interval [a, b] C X and every yo situated in the closed interval 
bounded by f(a) and f(b), there exists xo € [a, b] such that y = f(x). 


The previous discussion assures that the function f(x) = {x}, x € R, does not 
have the intermediate value property, while the function g(x) = x’, x € R, has 
such a property. Therefore, it compels us to say that, if a function f : X — R has 
the intermediate value property, then its graph should be continuous, i.e., with no 
jumps. Nevertheless, the reader can easily verify that the function f : [0,2] > R, 
given by 


ifO<x<1 
ifl<x<2’ 


fe=y 4 
A, 
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Fig. 8.3. Continuity from the correct viewpoint 


does satisfy the intermediate value property, albeit its graph presents a jump at 
x = 1. Thus, the intermediate value property is not the correct way to formulate 
the concept of continuous function. 

In order to adequately define the concept of continuous function, let’s analyse the 
whole situation from another point of view. Let f : (a,b) — R be a given function, 
Xo € (a, b) be a fixed real number and Po(%o,f(x0)) € Gy. Also, let x € (a, b) \ {xo} 
and P(x, f(x)) € Gy. For the graph of f to be called continuous at xo, our geometric 
intuition says that, the closer x is of xo, the closer P should be of Po. 

More precisely, this closedness means (cf. Fig. 8.3) that, for a fixed, arbitrarily 
chosen error r > O for the position of the point Po (i.e., given an arbitrary disk 
D(Po;1r), centered at Po and with radius r), we should have P € D whenever the 
abscissa x of P approximates xo within a sufficiently small error, say less than a 
certain 6 > 0 (one reads delta). In symbols, given an arbitrary r > 0, there must 
exist 6 > 0 such that 


|x —x9| <6 > PPo <r. (8.1) 
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Fig. 8.4 Elaborating the correct definition of continuous function 


Since every disk in the cartesian plane contains arbitrarily small rectangles with 
sides parallel to the axis, it is easy to show (cf. Problem 4) that the validity of the 
condition (8.1) is equivalent to the following alternative geometric description (cf. 
Fig. 8.4): given an arbitrary error € > 0 (one reads epsilon) for the value of f at xo, 
ie., given an arbitrary horizontal strip 


R x (fo) — €,f (xo) + €) 


of the cartesian plane, symmetric with respect to the point Po(xo,f(xo)) (the gray 
strip in Fig. 8.4)), there must exist another error 6 > 0 such that, for every x € X 
satisfying |x—xo| < 6, the point P(x, f(x)) belongs to the gray strip. This being said, 
we can finally state the formal definition of continuity of a function (at a point). 


Definition 8.2 A function f : X — R is continuous at the point x) € X if the 
following condition is satisfied: given € > 0, there exists 6 > 0 such that 


xeEX, |x—xo| < 6 > |f(X) —fQ0)| <. (8.2) 


The function f is said to be continuous if it is continuous at every point xp € X. 
Example 8.3 Every constant function is continuous. 


Proof Let c be a given real number and f : R — R be the function constant and 
equal to c. To show that f is continuous at x) € R, the definition of continuity asks 
that, given « > 0, we find 6 > 0 such that, for x € R, the validity of inequality 
|x — xo| < 6 implies that of |f(x) — f(xo)| < ¢. However, since |f(x) — f(%o)| = 
lc —c| = 0, inequality |f(x) — f(xo)| < € is always true, independently of any 
restriction on |x—xo|. Yet in another way, taking 6 equal to any positive real number, 
we will always have that |x — x9| < 6 => |f(~) —f(%o)| < ¢, for, the inequality 
If (x) —f(xo)| < € cannot be false. Oo 
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In order to establish the continuity of other, less simple functions, we need to 
elaborate a little more the former definition, and we do this next. 

Suppose we want to establish the continuity of f : X — R at xo € X. According 
to the given definition, we should assume that an error € > 0 for the value of f(xo) 
is given and, then, be capable of finding another error 6 > 0 for xo that turns true 
implication (8.2). In general, the following strategy is a good one: 


(i) Starting from x € X subjected to an error |x — x9| < 6, we estimate by excess 
the error |f(x) —f(xo)| in terms of 6, obtaining an inequality of the type |f(x) — 
f (x0)| < E(6), where E represents a certain function of 6. 

(ii) Then, we impose that such an error E(6) doesn’t surpass the desired error e€, 
thus finding the appropriate values of 5. Usually, this second step reduces to 
solving, for 5 > 0, inequality E(6) < e. 


Once we execute the two steps above, if 6 > 0 satisfies E(6) < €, we will clearly 
have that 


x € X and |x—xo0| < 6 > |f(@) —fQ0)| < E() <«, 


as we wished to show. 
As a last remark, the coming examples will make it clear that, along the execution 
of items (i) and (ii), the function E will be generally implied. 


Example 8.4 The modular function (cf. Example 6.60) is continuous. 


Proof Given xo € R, if |x — xo| < 6, then triangle inequality gives 
If) —f@o)| = [lx] — [xoll < lx — 20] < 6. 


Hence, if 6 < «, it will follow from the above that |f(x) — f(xo)| < € whenever 
|x = xo| <6. oO 


Example 8.5 Let a and b be real numbers, with a # 0. If f : R > R is given by 
f(x) = ax + Db, then f is continuous. 


Proof Again, for a given xp € R, se |x — xo| < 6, we have 
If) —f@o)| = |(ax + b) — (axo + b)| = lal|x — x0] < als. 


Hence, if |a|é < € (or, equivalently, 6 < rae we have |f(x) — f(xo)| < € whenever 
bs = xo| <6. oO 


Example 8.6 The square root function f : [0, +oo) > R, such that f(x) = ./x for 
x > 0, is continuous. 


Proof If x9 = 0 and 0 < x < 6, then |f(x) —f(xo)| = J/x < V6. Since V8 <€ © 
6 < €?, for0 < 8 < €? we have [f(x) —f(xo)| < € if |x — 0| < 8. 
Suppose, now, that x) > 0. Then, for |x — xo| < 6, we have 


IF) —f@0)| = |V¥— V0] = = 
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Fig. 8.5 | sinx| < |x| 


Note that Ta <¢e< 6 < ,/xoe. Therefore, taking 0 < 5 < ./xoe, we conclude 

that |x — x0| < 6 => |f(x) —f(xo)| < €, as wished. Oo 
In what comes next, we establish the continuity of the sine and cosine functions. 

To this end, we shall need the following auxiliary result. 

Lemma 8.7 For every x € R, we have | sinx| < |x|. 

Proof Firstly, let’s show that sinx < x whenever 0 < x < 4. This inequality is 

obvious for x = 0; for0 < x < 4, mark (cf. Fig. 8.5) point P in ce first quadrant of 


the unit circle such that 0(AP) =X) 


Letting P’ be the symmetric of P with respect to AA’ and @Q the intersection of 


P’P and A’A, we have P’/P = 2 OP and £( PP) = = 20(AP). Since the length of every 
arc of a circle is greater than that of the corresponding chord, we get 


2sinx = PP’ < 0(PP’) = 20(AP) = 2x. 


Now, since ont) = —sinx, it’s immediate that | sinx| < |x| for |x| < 
Finally, for |x| > 4, we have 


1 
|sinx] <1< = < |x|. 


Example 8.8 Sine and cosine functions are continuous. 


Proof Fix xo € R. If |x — xo| < 4, it follows from the sum-to-product identities, 
together with the previous lemma, that 
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_ (X—X0 
|cos.x —~cos.xo| = 2|sin ( 5 )| 
<afon (2) 22] 52 
2 2 


= |x—Xxo| < 6. 


Hence, if 6 < €, then | cosx — cos.xo| < € whenever |x — x| < 6, as wished. 
Finally, the reasoning for the sine function is completely analogous. Oo 


Later (cf. Proposition 8.19), we shall prove that if / is an interval andf, g: 1 — R 
are continuous at xo € J, then the functions f + g,f- g : 7 — R are also continuous 
at xo. On the other hand, we shall show in Proposition 8.21 that, if g doesn’t vanish 
in J, then the function £ 7 — Ris also continuous at xo. For the time being, we 
assume these facts without proof, using them to present some additional examples 
of continuous functions. 


Example 8.9 Given a € R andk €N, the function x ++ ax* (from R to itself) can 
be seen as the product of a constant function (equal to a) and k copies of the identity 
function, x +> x. Therefore, Examples 8.3 and 8.5, together with the discussion at 
the last paragraph (extended to the product of k continuous functions), assure that 
the function x +> ax* is also continuous. 

Applying once more the discussion at the paragraph that precedes this example 
(this time extended to the sum of a finite number of continuous functions), we 
conclude that every polynomial function, i.e., every function f : R — R of the 
form 


F(X) = nx" + ayix" | +++» + ax t ao, 


for some n € N and ao, dj, ..., dy € R, is continuous too. 


Example 8.10 Given a polynomial function f : R — R (see the preceding 
example), we say that xo € R is a root of f if f(xo) = 0. It is possible to prove 
(cf. Chap. 15 of [5], for instance), that every polynomial function has a finite 
(possibly zero) number of real roots. Hence, if f and g are polynomial functions 
and Ry = {x1 < x2 < +++ < x;} is the set of real roots of g, the complement 


R\ Reg = (—00, x1) U (41,.x2) U... U (re—1, x4) U (xe, +00) 


is a finite union of open intervals. This way, we get the (well defined) rational 
function 


I R\R, oR. 
& 


The previous example, together with the discussion at the paragraph that precedes 
it, assures the continuity of rational functions, wherever they are defined. 
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Example 8.11 Function f : R > R, given by 


x* sinx — (x? + 1) cosx 


fa) = x8 + 24+ cosx 


is continuous. Indeed, f is the quotient of the functions g,4 : R — R, such that 
g(x) = x* sinx — (x? + 1) cosx and A(x) = x® + 2 + cosx. The previous examples 
and discussions guarantee the continuity of g and h, and function h doesn’t vanish, 
for x8 > 0 and2+cosx > 0. 


We close this section by examining the continuity of a composition. In this sense, 
the coming result is known as the chain rule for continuous functions. 


Proposition 8.12 [f X,Y C Rare unions of intervals andf : X > Y,g:Y—>R 
are continuous functions, then g o f : X — R is also continuous. 


Proof Let x9 € X and yo = f(xo). Given € > 0, the continuity of g guarantees the 
existence of 6 > 0 such that 


ye, ly—yol < 6 > |g) — a0) <€. (8.3) 


On the other hand, the continuity of f assures the existence of 5’ > 0 such that 


xeX, |x—x|<8' >| 7@) —fO0) | <6. 
—S—=—=—— 


y yo 
Therefore, it follows from the above relations (with y = f(x) in (8.3)) that 


x €X, |x—xo| < 8’ = |f(x) —f(0)| < 8 => [g(F@)) — sf) <€, 


thus establishing the continuity of g o f at xo. oO 


The two coming examples show typical applications of the chain rule for 
continuous functions. 


Example 8.13 Function f : R > R, given by f(x) = sin(x7), is continuous. Indeed, 
if g,h: R > Rare given by g(x) = sinx and h(x) = x’, then g and hare continuous 
and f = g oh; hence, the chain rule assures the continuity of f. 


Example 8.14 Function f : R > R, given by 


x2| sin(x2)| + 1 
SS ————— 
x4 +x? +24 ./| sinx| 


is continuous. Indeed, since f is the composition of the square root function (which 
x?| sin(x2)|-+1 
x42242+/]sina]’ 


the chain rule assures that it suffices to establish the continuity of g. Now, g is the 


is continuous) with the function g : R — R given by g(x) = 
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quotient of the functions x +> x°| sin(x*)| + 1 and x H xt +27 +24 /|sinal, 
which are continuous by the previous examples, together with the chain rule. Finally, 
observe that both the numerator and denominator of g(x) are positive. 


Let us finish this section with a result on the set of points of discontinuity of 
monotonic functions. 


Proposition 8.15 /f 7 C R be an interval and f : I — R is an nondecreasing 
function. then the set of points of discontinuity of f is finite or countably infinite. 


Proof Let 
D = {xo € I; xo is not an endpoint of J and f is discontinuous at xo}. 


For each xo € D, let m,, = sup{f(x); x € I and x < xo} and M,, = inf{f(x); x € 
I and x > xo}. Problem 13 guarantees that m,, < M,,. Now, Problem 4, page 206 
guarantees that we can choose ry, € QN (my), Mxo). On the other hand, if x9 < yo 
are both in D, the fact that f is nondecreasing gives 


Xx 
M,, <f ae) < my, 


so that the intervals (m,,, My.) and (my,,M,,) are disjoint. 
Hence, the correspondence x9 + r,, defines an injective function from D into Q, 
and it follows from Problem 21, page 172, that D is finite or countably infinite. O 


In the notations of the statement of the previous proposition, we call the reader’s 
attention to the fact that Problem 14 will guarantee that every countably infinite 
subset of J is the set of points of discontinuity of a nonincreasing function. 


Problems: Section 8.1 


1. Prove that: 


(a) The function f : R \ {0} > R, given by f(x) = 1, has the intermediate 
value property and is continuous, but the function f : R > R given by 


+, ifx 40 
fo = 1 ifx=0 


doesn’t have such property. 
(b) The function f : R > R, given by 


r+, ifx>0 
—x, ifx <0 


fx) = 


doesn’t have the intermediate value property. 
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. Let 7 C R be an interval and f : J — R be a continuous function. For a fixed 


Xo € I, let g: | > R be the function given by 


c, if x = xo 


B(x) = f(x), if x A x0 - 


Show that g is continuous if and only if c = f(0), i.e., if and only if g =f. 


. In each of the following items, find out whether there exists a real value of c 


that turns the function f : R — R into a continuous one. Justify your answers. 


3x—2, ifx<0 3x-—2, ifx <0 
(a) f(x) = (cc, ifx=0 . (c) fx) =4c, ifx=0 
—2, ifx>0 x7, ifx>0 
cosx, ifx 40 1, ifx 40 
(b) fie) = 17008 HAO @ fy ade ETFO. 
c, ifx =0 c, ifx=0 


. * Prove that every function continuous in the sense of relation (8.1) is 


continuous in the sense of Definition 8.2, and vice-versa. 


. *LetD =R \ {a/2+ ka; k ©€ Z}. Use the discussion at the paragraph that 


precedes Example 8.9 to establish the continuity of the tangent function, 


tan: D—>R 
x Ke > tanx’ 


. If f : X > R is a continuous function, explain why the function |f| : X > R is 


also continuous. 


. Use the results of this section to establish the continuity of the function f : 


[—-1, +00) > R, given by f(x) = aren 


. Letn > 1 be a natural number and f : [0, +co) — R be the n-th root function, 


f() = */x for x => 0. 


(a) Show that f is continuous at x) = 0. 
(b) If x9 > 0 and x > 0, show that 


|x — Xo| 


f(x) —f(xo0)| = Wa + aye be ay 


- |x — xo|. 


< ————— aes 
(x/x0)" 


(c) Use the result of (b) to conclude that f is continuous at xo. 


. Use the results of this section to establish the continuity of the function f : R > 


7 — 3/xt-2341 
R, given by f(x) = 4/ a ee 


8.1 


10. 


11. 


12. 


13. 


14. 
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Justify the continuity of the function f : R — R, given by 


0, ifx=0 
xsint, ifx #07 


f@ = 


* The Dirichlet function! is the function f : [0, 1] > R such that 


_ (0ifx¢Q 
(OS) 


Prove that f is discontinuous at every x € [0, 1]. 
Let f : [0, 1] — R be the function given by 


0, ifx EQ 


m 


f@) = +. if x = 4 with me Z,,n €N and ged(m,n) = 1 

Prove that f is discontinuous at every rational number and continuous at every 
irrational number” of the interval [0, 1]. 

Let J C R be an interval and f : J — R be a nondecreasing function. If x9 € I 
is not an endpoint of /, then 


sup{f(x); x € J and x < xo} < inf{f(x); x € J and x > xo}, 


with equality if and only if f is continuous at xo. 
Let D = {x1,X2,x3,...} be an arbitrary countably infinite set of reals. Let f : 
R — R be given by 


f= s, 


néN; 
XX 


where the sum is interpreted as being equal to 0 if {n € N; x, < x} = 9. Prove 
that: 


(a) f is well defined and nondecreasing. 
(b) The set of points of discontinuity of f is D. 


‘After Gustav Lejeune Dirichlet, German mathematician of the XIX century. Dirichlet made sev- 
eral important contributions to Mathematics, notably to Analysis and Number Theory. Even though 
the most famous of these is perhaps Dirichlet’s theorem for primes in arithmetic progressions 
(about which we shall have more to say on [5]), Dirichlet made very important contributions to 
the theory of Fourier Series, which we will tangentially touch upon in Chap. 11. 


7It is possible to prove that, given an interval J, there doesn’t exist a function f : J > R continuous 
at (every point of) 7M Q and discontinuous at JN (R \ Q). A proof can be found in [1]. 
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In this section we relate the concepts of limit of a convergent sequence and 
continuity of a function. The main result is the coming one, which will be used 
several times in all that follows and assures that continuous functions are exactly 
those that preserve convergent sequences. 

In all that follows, J denotes an interval of the real line. 


Theorem 8.16 A function f : I — R is continuous if and only if the following 
condition is satisfied: for every a € I and every sequence (an)n>1 of elements of I, 
we have 


lim a,=a> lim f(a,) =f(@. 
n—>+oo 


n—>+oo 


Proof Let’s first assume that f is continuous. Then, given a € J and e€ > 0, there 
exists 6 > 0 such that 


xeéel, |x-—al <6 > |f(x~) -f(@| <e. 


Now, let (an)n>1 be a sequence of elements of J converging to a. Then, there exists 
no € N such that 


n> no => |an—al <6. 
Taking together the two conditions above, we get 
n>ng> lan —al <é=> lf (an) —f@| <€, 


which is the same as saying that limy,++00f(an) = f(a). 

Conversely, suppose that f is not continuous at a € J. Then, the definition of 
continuity guarantees the existence of «€ > 0 such that, for every 6 > 0, we have 
If) —f(a|>e for some x5 € T satisfying |x3 —a| < 6. In particular, taking 6 = -, 
with n € N, and writing a, = X1/n, we get 


Jamal <~ and [fla,)—fla)l >. 


Then, the above conditions assure that the sequence (a,),>1 thus constructed 
converges to a, whereas the sequence (f(a,))n>1 doesn’t converge to f(a). Oo 


Let us illustrate the importance of the previous result with two examples, the 
first of which revisits Example 7.19 and shows that the idea used in the proof of 
Example 7.23 is quite natural. 
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Example 8.17 Ifa > 0 anda, = #/a, thena, —> 1. 


Proof We make the proof in the case a > 1, the proof in the case 0 < a < | being 
completely analogous. If a > 1, then a, > a,4, => 1, for every n € N. Hence, the 
sequence (a,,),>+1 is monotonic and bounded, thus convergent. 

Setting 7 = lim,-++4 0 dn, it follows from item (a) of Proposition 7.15 that 
1 > 1. On the other hand, the continuity of the square root function (according 
to Example 8.6) and the previous result guarantee that 


Qn —> 1 > fan —> V1 => aan —> VII, 


where in the last implication we used the fact that ./a, = day. 
Since every subsequence of a convergent sequence converges to the same limit 
n is . . 
of the whole sequence, we have az, —> /. Therefore, the uniqueness of limits of 
convergent sequences gives / = V/J, so that / = 1. Oo 


Example 8.18 (Sweden) Find all continuous functions f : R — R such that f(x) + 
f(x) = 0, for every x € R. 


Solution If f is any function satisfying the stated conditions, we shall show that f 
is identically zero. 

Making x = 0 and x = 1, we get f(0) = 0 and f(1) = 0. Now, for x > 0 and 
applying the stated condition several times, we get 


f@) = FWD =fW9 = f(D =>. 


In particular, |f( %/x)| = [f(x)| for every n € N. 


Letting a, = 7/x, we already know that a, —". 1. On the other hand, since |f| 
is also continuous, Theorem 8.16 guarantees that 


Lf (an)| —> [f()| = 0. 


However, since |f(a,)| = |f(x)| for every natural n, we also have |f(a,)| —s If (x)|. 

Therefore, the uniqueness of the limit of a convergent sequence gives |f(x)| = 0. 
Finally, for x < 0 we have x” > 0. Hence, by what we’ve done above, f(x”) = 0, 

so that f(x) = f(x) + f(x’) = 0. oO 


Theorem 8.16 allows us to easily prove the claims made at the paragraph that 
precedes Example 8.9. We start by establishing the continuity of the sum and 
product of two continuous functions. 


Proposition 8.19 If f,g :I— Rare continuous at xp € I, thenf+¢,f-g:I1>R 
are also continuous at xo. 


Proof Let (dn)n>1 be a sequence in J, such that limp++o0d, = x9. According to 
Theorem 8.16, the continuity of f and g at xo gives 


tm flan) = f(xo) and lim | 8(an) = g(X%o). 
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It thus follows from item (b) of Proposition 7.18 that 


lim (f = 8) (an) = dim (f(an) ae 8(Gn)) 


n—>+oo 


= lim (Gn) = lim g(dn) 
—> +00 


n>+ 


= f (xo) + g(xo) = FF = g) (x0). 


Hence, once more from Theorem 8.16, we conclude that f + g are continuous at x. 
The reasoning for f - g uses item (c) of Proposition 7.18 and is totally similar to 
what we did above, so that it will be left to the reader. oO 


Before we look at the continuity of the quotient of two continuous functions, 
we need an auxiliary result, known as the sign-preserving lemma for continuous 
functions and which is important in itself. 


Lemma 8.20 Let I C R be an interval and f : I > R be a continuous function. If 
Xo € 1 is such that f (xo) > 0 (resp. f (xo) < 0), then there exists 6 > 0 such that 


xéel, |x-—x| <6 >f() > ea (resp. f(x) < 1a 


In particular, f is still positive (resp. negative) in IN (xo — 6, x0 + 4). 


). 


Proof Let’s do the proof in the case f(xo) > 0, the proof in the other case being 
totally analogous. 


The definition of continuity guarantees that, fore = £0) fae > 0, there exists 6 > 0 
such that 
xel, |x—x| <8 > If) —f(00)| <€ = - 
On the other hand, this inequality implies 
fla) — Fo) > 2&2 
or, which is the same, f(x) > feo) , for every x € IM (xp — 6, x9 + 8). Oo 


If g : J > R is continuous at xo € J and such that g(xo) # 0, the sign-preserving 
lemma assures the existence of 6 > 0 such that g doesn’t vanish in the open interval 
J =I1N (x — 46, xo + 5). Hence, by restricting g to J, if necessary, we can suppose, 
without loss of generality, that g doesn’t vanish in J. 


Proposition 8.21 Let f,g:I— R be continuous at xp € I. If g doesn’t vanish in I, 
then f : I > Ris continuous at xo. 


Proof As in the proof of the previous proposition, if (an)n>1 is a sequence in J, such 
that limy— +00 dn = Xo, then 


lim (Gn) = fo) and — lim. g(an) = g(xo). 
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Now, item (e) of Proposition 7.18 gives 


ee 2 . fan) — limps+oof(an) — fo) _ f 
lim —(a,) = lim = = = —(X0). 
n> +00 g nto g(an) ‘lin +400 8(4n) 8%) 8 
Invoking Theorem 8.16 once more, we conclude that f is continuous at xo. oO 


In what comes next, we introduce a stronger notion of continuity, and show that 
it is equivalent to the usual one for functions defined in intervals J = [a, b] C R. 


Definition 8.22 A function f : J > R is uniformly continuous if the following 
condition is satisfied: for every « > 0, there exists 6 > 0 such that 


xyel, |x—y| <b > [f(x) -—fO)| <e. (8.4) 


In words, the difference between the definitions of continuous and uniformly 
continuous functions is that, for a uniformly continuous function and given € > 0, 
the number 6 > 0 whose existence is assured by the definition depends only on 
the chosen error € > 0, being the same for all x,y € I; on the other hand, in the 
definition of continuity of a function at a point xp of its domain, such a 6 depends 
on the error € > O as well as on Xp. 

It is clear from the former definition that every uniformly continuous function 
is, in particular, continuous: it suffices to make y = xo, where xo € J is arbitrarily 
chosen. On the other hand, Problems 4 and 6 bring examples of continuous functions 
which are not uniformly continuous. 

We now pause to give an example of a uniformly continuous function which will 
be useful for later purposes (cf. Example 11.22). To this end, for x € R we let d(x) 
denote the distance of x to the nearest integer, so that 


ifO0< ee 
d(x) = {x}, if OS ty <9 
1—{7),.1 5s i7) <1 
Example 8.23 In the above notations, let a > 0 be given. If x, y € R are such that 
Ix—y| < i, then |d(ax) - d(ay)| < a|x — y|. In particular, x +> d(ax) is uniformly 
continuous. 


Proof If |x—y| < 1, then |ax — ay| < 1. In this case, the integers closest to ax 
and ay, if not equal, are consecutive. Assume this is so, and let n and n + 1 be the 
integers closest to ax and ay, respectively. Then, 


1 
a a a 
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so that 


|d(ax) — d(ay)| = (n+ 1 —ay)- (ax —n)| = |2n + 1 — ay—ax 


=|(o+ 9) + (04 5a) 


1 
< |n+ =—ay|+ 
= hrs ay| 


wi 
n —-—ax 
5 


= lay — ax| =a|x—y]. 


For the last part, given e > 0 take 6 = min{+, <} > 0. Then, 


1 € 
Ix—y| <6 > |x-yl <-> |d(ax) — d(ay)| <a|x—y| <a--=e. 
a a 


oO 


The coming result proves that, for a function defined on a closed and bounded 
interval, the concepts of continuity and uniform continuity coincide. 


Theorem 8.24 Every continuous function f : [a,b] > R is uniformly continuous. 


Proof By contradiction, suppose that f is continuous but (8.4) is not valid. Then, 
there exists « > 0 such that, for every 5 > 0, we can find x, y € [a, b] satisfying 


|x—y| <6 and |f(x) —f(y)| 2 €. 


1 


In particular, choosing 6 = = with n € N, we conclude that there exist elements 


Xn, Yn € [a, b] such that 


1 
Xn — Yn| < 7 and |f(%n) —fOn)| = €. (8.5) 
Since (%,)n>1 is a sequence in [a, b], Weierstrass Theorem 7.25 guarantees the 


existence of a subsequence (Xn, )k>1 Of (Xn)n>1, Converging to some x9 € [a,b]. 
Then, it follows from triangle inequality that 


1 k 
LYng — X01 S [Yang — Xnq| + Xn — X0l < aa a ty = 26 = 
k 


Hence, (yn, )x>1 also converges to x9, and Theorem 8.16, together with the continuity 
of x +> |x|, gives 


iim fn) —fOn)| = Fo) —fo)| = 0. 
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On the other hand, as a particular case of (8.5), we have |f(%n,) —fOQn)| = €; 
and such an inequality obviously contradicts the former limit. Oo 


The previous theorem admits the following important consequence. 
Corollary 8.25 Every continuous function f : [a,b] > R is bounded. 


Proof Since such an f is uniformly continuous, given ¢ = 1 there exists 6 > 0 
such that 


x,y € [a,b], |x—y| < 6 => |f() —f0)| < 1. (8.6) 


Now, choose real numbers a = xo < x1 < +++ < x, = b satisfying x; — xj) < 6 
for 1 <i<k, and let 


M = max{|f(x1)|,.... [fOr |}. (8.7) 


For x € [a,b], there exists 1 < j < k such that x € [xj;-1, xj]; in particular, |x — 
xj-1| < |xj — xj-1| < 6. Then, the triangle inequality, together with (8.6) and (8.7), 
gives 


IF)! SF) — FO) + FDI < 1+ If@)l s1+M. 
Finally, since x € [a,b] as in the last paragraph was arbitrarily chosen, we get 


If(x)| < M + 1 for every x € [a, b], so that f is bounded. Oo 


The last property of continuous functions we shall study in this section is also 
due to Weierstrass, and states that every continuous function f : [a,b] > R assumes 
extreme values in the interval [a, b]. 


Theorem 8.26 (Weierstrass) Jf f : [a,b] — R is continuous, then there exist 
Xm;Xu_ € [a, b] such that 


fm) = min{f(x); x € [a, b]} and f(xm) = max{f(x); x € [a, b]}. 


Proof Let’s prove the existence of x,,, that of xy being totally analogous. Since f is 
bounded, it makes sense to let 


m = inf{f(x); x € [a, b]}. 


The definition of greatest lower bound assures (with the aid of Problem 10, 
page 207) the existence of a sequence (x,),>1 in [a,b] such that f(%,) — m. 
Since every bounded sequence possesses a convergent subsequence, we can take 
a subsequence (X,,)x>1 Of (%n)n>1, Converging to a real number xo € [a, b]. Then, 
Theorem 8.16 gives 


_lim_fG@n) =f). 
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On the other hand, since the sequence (f(,,))x>1 is a subsequence of (f(%;))n>1 
(which, in turn, converges to m), we conclude from Proposition 7.15, that (f(%,,))x>1 
also converges to m. Therefore, the uniqueness of the limit of sequences gives 


Ff (x0) = m. O 


Problems: Section 8.2 


1. Find all continuous’ functions f : R — R such that, for all x, y € IR, we have 
f(x + y) =f) +f). 

2. (Romania) Let f : R — R be a surjective function satisfying the following 
property: for every divergent sequence (a,)n>1, the sequence (f(a;))n>1 is also 
divergent. Prove that f is bijective and that f—! is continuous. 

3. * Given an interval J C R, we say that a function f : J > R is lipschitzian* if 
there exists a constant c > 0 (called the Lipschitz constant of f) such that 


If) —fO)| < clx—yl, 
for all x, y € J. For example, we saw in Example 8.8 that 
|sinx—siny| <|x—y| and |cosx—cosy| < |x—y| 


for all x,y € R, so that both sine and cosine functions are lipschitzian, with 
Lipschitz constant equal to 1. Prove that every lipschitzian function f : J > R 
is uniformly continuous. 

4. For an integer n > 1, show that the function f : R > R, given by f(x) = x”, is 
not uniformly continuous. 

5. Let 7 C R be an interval and f : J — R be a continuous function. Suppose 
that there exist € > 0 and sequences (a,),>1 and (b,)n>1 in J, such that |a, — 
b,| —> 0, but lf (an) —f(bn)| = € for every n > 1. Show that f is not uniformly 
continuous. 

6. Show that the function f : (0,+00) — R, given by f(x) = sin 1, is not 
uniformly continuous. 

7. The purpose of this problem is to present another proof of Theorem 8.26. To 
this end, recall that Corollary 8.25 assures the boundedness of a continuous 
function f : [a,b] — R, so that there exist 


m = inf{f(x); x € [a, b]} and M = sup{f(x); x € [a, b]}. 


3]t can be shown that the hypothesis of continuity is essential here. More precisely, there exists 
infinitely many functions f : R — R such that f(x + y) = f(x) + f() for all x, y € R but f is not 
linear. A proof of this fact is beyond the scope of these notes and can be found in [13]. 


4 After Rudolf Lipschitz, German mathematician of the XIX century. 


8.2 


10. 
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Now, do the following items: 


(a) Ifm ¢ Im(f) and g: [a,b] > R is the function given by g(x) = Foon for 
every x € [a,b], show that there exist c > 0 such that g(x) < c, for every 
x € [a, b]. 

(b) Yet under the hypothesis of item (a), conclude that f(x) > m+ + for every 
x € [a, b] and arrive at a contradiction. 

(c) Argue in an analogous way to show that M € Im/(f). 


. Given real numbers a < b, give an example of a continuous function f : 


(a, b) + R, which is unbounded from above and from below. 


. Iff : (a,b) > Ris a continuous function and P(x, yo) is a point not belonging 


to the graph of f, we define the distance from P to Gy, denoted d(P; Gy), by 
d(P; Gr) = inf{A’P; A’ € Gy}. 

If |yo —f(%0)| < min{|xo — a], |x0 — b]}, prove that there exists A € Gy such that 

d(P; Gr) = AP. 


* Let f : R > R be the polynomial function given by 


F(0) = nx" + An"! + +++ + ax + ag 


for x € R, where ao, a),...,4, € R anda, # 0. If n is even and a, > 0, prove 
that: 
(a) faa = i= ldn—1] ah if x F 0. 


x" 


(c) There exists A > 0 such that f(x) > ao for |x| > A. 
(d) There exists x) € [—A, A] such that f(xo) = min{f(x); x € R}. 


x| 
(b) f= > ga, — Wt aon |a;| if |x| > 1. 


. * The purpose of this problem is to prove the famous Banach fixed point 


theorem? in the real line. To this end, let 0 < c < landf : R > Rbea 
function such that 


If@) —fO)| Ss clx—yl], Vay eR. (8.8) 


Do the following items: 


(a) Choose an arbitrary xo € R and let (X%,)n>1 be such that x, = f(xx-1), for 
every k € N. Prove that |x,4.1) — xx| < clxx — xg-1| for every k € N. 

(b) Use the result of Example 7.28 to conclude that (x,,),>1 is convergent. 

(c) If @ = lim,++.0%,, use the continuity of f (which is guaranteed by 
Problem 3) to show that a is the unique fixed point of f. 


> After Stefan Banach, Polish mathematician of the XX century. 


264 8 Continuous Functions 


12. Give an example of a continuous function f : R — R, without fixed points and 
such that |f(x) — f(y)| < |x — |, forall x,y € R. 

13. * Let f : [0,1] — R be a continuous function such that f(r) > 0 for every 
dyadic rational (cf. Problem 5, page 206) r € [0, 1]. Prove that f(x) > 0, for 
every x € [0, 1]. 

14. (Berkeley) Let a be a given irrational number. Find all continuous functions 
f :R— R such that 


f@~) =fe+)=fa+a), 


for every x € R. 


8.3. The Intermediate Value Theorem 


In this section, we show that a continuous function defined on an interval does 
satisfy the intermediate value property. Then, we present several interesting appli- 
cations of this fact. We start by analysing the following special case, known as 
Bolzano’s theorem. 


Theorem 8.27 (Bolzano) Let f : [a,b] — R be a continuous function. If 
f(a)f(b) < 0, then there exists c € (a, b) such that f(c) = 0. 


Proof Suppose, without loss of generality, that f(a) < 0 < f(b), and let 
A = {x € [a,b]; f is negative on [a, x]}. 


Since a € A by hypothesis, we have A # @. On the other hand, A is bounded (for, 
A C [a, b]), so that we can take c = sup A. We shall show that f(c) = 0. 

We initially assert that c > a. Indeed, since f(a) < 0, Lemma 8.20 guarantees 
the existence of 0 < 6 < b—a such that f(x) < 0 for x € [a,a + 5); therefore, 
c>at+6. 

Now, suppose f(c) < 0. Then, c < b (for, f(b) > 0) and, once more from the 
sign-preserving lemma, there would exist 0 < 6 < b—c such that f is negative in 
(c —6,c + 6). However, since c = supA, we could take d € (c — 5,c) NA, so that 
f < 0 in [a, d]. Hence, we would have f < 0 in [a, d] U (c —6,c + 6] = [a,c + 8), 
which is in contradiction to the fact that c = supA. 

Finally, suppose f(c) > 0. Then (by invoking the sign-preserving lemma yet 
another time), there would exist 6 > 0 such that f is positive in (c—6,c +6) [a, b]; 
in particular, AM (c — 6, c] = 9, and we would have supA < c — 6, which is also a 
contradiction. 

Therefore, the only left possibility is that f(c) = 0. oO 


Example 8.28 If f : R > Ris a polynomial function of the form 
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F(X) = nx" + apix" | +++ + ax t ao, 


with ao, a1,...,dn € R, an 4 0 and n odd, then the image of f is R. In particular, f 
has at least one real root. 


Proof Givend € R, let g(x) = f(x) —d. Then, g : R > R is a polynomial function 
satisfying the same hypotheses as f and, for c € R, we have f(c) = d if and only 
if g(c) = 0. This reasoning reduces the example to proving the existence of c € R 
such that f(c) = 0. To this end, we shall use Bolzano’s theorem. 

Without loss of generality, let a, > 0. For x 4 0, several applications of the 
triangle inequality give 


X aAn—-1 a{ a 
OD og DOD hs — ai 
x x x” Die 
aAn— a a 
> an — nal oe 1 isl) 
x xn-l xn 
an-1 a\ a0 
2 [tet—— 
oC a lai] lao] 
"al ae 


y i 3 ols 1 n—-1 
in turn, this last expression is positive for |x| > Pa Yj=0 lail- 


f 


The argument at the previous paragraph shows that acs) > Oatx = +A, whenever 


F(A), and Bolzano’s theorem assures the existence of c € [—A, A] such that f(c) = 0. 
oO 


A > max {I + Be ail}. However, since n is odd, it follows that f(—A) < 0 < 


Our next result is a refinement of Bolzano’s theorem, which shows that every 
continuous function defined on an interval satisfies the intermediate value property. 
For this reason, this result is known as the intermediate value theorem (we 
abbreviate IVT). 


Theorem 8.29 (IVT) Letf, g : [a,b] > R be continuous functions. If f(a) < g(a) 
and f(b) > g(b) (or vice-versa), then there exists c € (a,b) such that f(c) = g(c). 
In particular, if a real number d belongs to the interval with endpoints f (a) and f (b), 
then there exists c € [a,b] such that f(c) = d. 
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Proof For the first claim, note that h = f—g is continuous and such that h(a)h(b) = 
(f(a) — g(a)) (f(b) — g(b)) < 0. Hence, Bolzano’s theorem gives c € (a, b) for which 
h(c) = 0, i.e., such that f(c) = g(c). 

The particular case of the second claim is obtained by taking g to be the constant 
function, equal to d. Oo 


As a first application of the IVT, let us show that the image of a continuous 
function defined on an interval is also an interval. 


Corollary 8.30 [f7 C Ris an interval andf : I > R is a continuous function, then 
the image of f is also an interval. If I = [a, b], then there exist real numbers c < d 
such that Im(f) = [c, d]. 


Proof Let’s first consider the case J = [a, b]. By Weierstrass Theorem 8.26, there 
exist Xm,xm € [a, b] such that f attains its minimum and maximum values at x, and 
Xu, respectively. Letting f(%m) = c and f(xm) = d, we have Im(f) C [c, d]. On the 
other hand, for a fixed y € [c, d], the IVT gives a real x, belonging to the interval of 
endpoints x,, and xy and such that f(x) = y. In particular, Im(f) D [c, d]. 

For the general case, note that every interval J can be written as J = See [an, Dn], 
with [a;,b,] C [a2, bo] C ... (for instance, if 7 = (a, b), we could take a, = a+ 4 
and b, = b— 1, provided n > Eo —the remaining cases can be dealt with in 
analogous ways). Now, letting [c,,d,] be the image of the interval [a,, b,] by f, 
it’s easy to show that [c),d\] C [c2,d2] C ... and, hence, that LU), ,[cn. dn] is an 
interval J. On the other hand, it’s also easy to see that, since J = Cee [an, Dn], we 
have 


Im(f) = |_Jlen. du] = J. 


n>1 


In what follows, we discuss some other interesting applications of the IVT. 


Example 8.31 Let f : [0,1] — [0,1] be a continuous function. Prove that there 
exists a real number 0 < c < 1 such that f(c) = c (i.e., prove that f has at least one 
fixed point). 


Proof If f(0) = 0 orf(1) = 1, there’s nothing left to do; otherwise, suppose f(0) > 
0 and f(1) < 1. Letting g : [0, 1] — R be the function given by g(x) = x, we then 
have f(a) > g(a) and f(b) < g(b). Hence, the IVT gives 0 < c < | satisfying 
f(c) = g(c) or, which is the same, f(c) = c. Oo 


Example 8.32 (Bulgaria) Let m,n = 1 be given integers. For x > 0, compute the 
number of real solutions of the equation 


i, 1 
ba tat SS Sa ast Ve tee ee. 
Xx Xx XxX Xx 


Solution We claim that the given equation has just one real solution. To this end, 
note first that, for x > 0 and a natural k, the function x J is decreasing, whereas 


x 
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the function x +> 4/x is increasing. Now, since a finite sum of increasing functions 
on the same domain is increasing and a finite sum of decreasing functions on the 
same domain is decreasing (prove these facts!), we conclude that the functions f, g : 
(0, +00) — R given by 
1 1 1 1 : P 
fa)=rotatate ts and a) =x+ SX + LX + Ax 

are decreasing and increasing, respectively. On the other hand, since the solutions 
of the given equation correspond to the positive values of x for which f(x) = g(x), 
Problem |, page 183, guarantees that such equation has at most one real solution. 

In order to show that a real solution indeed exists, let’s use the IVT, initially 
observing that both f and g are continuous functions. We then consider two cases 
separately: 


¢ forx > 1, we have 


1 1 1 1 
peers p< - and c>4/x>--> "Yx> Wz, 
Xx Xx 


xm xn 
so that f(x) < 4 and g(x) > nix. In particular, f(x) < g(x) if 4 < nix, ie., if 


x> (2), Hence, f(x) < g(x) for x > max {1, (2yar. 
¢ for 0 <x < 1, we have 


1 1 
SSS and x< Ja <r < "Va < Vx, 
X x 


so that f(x) > 4 and g(x) < nix. In particular, f(x) > g(x) if 4 > nvvx, ie., if 


x< (2), Hence, f(x) > g(x) for0 <x < min {1, (2) ; 


Finally, the previous discussion assures that we can take real numbers 0 < a < 
1 < b satisfying f(a) > g(a) and f(b) < g(b), and the IVT provides c € (a, b) such 
that f(c) = g(c). Oo 


Example 8.33 (Romania) There exists a continuous function f : R — R such that 


MEQ S fat 1) ¢Q 


Solution Suppose such an f exists, and let g : R — R be given by g(x) = 
f(x«+ 1) —f(@). Then, g is continuous (by the chain rule for continuous functions) 
and, by the stated conditions, transforms every real number into an irrational 
number. However, since every nondegenerate interval contains irrational numbers 
(see Problem 4, page 206), the only way of not contradicting the IVT is that g is 
constant. Thus, there exists an irrational number @ such that f(x + 1) —f(x) = o@ for 
every x € R. Therefore, 


f@t+2)—-f(@%) =f«+2)-fa@t+ I+fa+ Il) —-f@) = 2a, (8.9) 


also for every x € R. 
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We now assert that there exists x7 € R such that f(x) € Q. Indeed, take any 
real number a; if f(a) is not rational, it follows from the hypotheses that f(a + 1) is 
rational, and it suffices to take x» = aorxy) =a+t+ 1. 

Finally, for xp as in the former paragraph, the hypotheses on f guarantee that 


f (xo) € Q=> fo + 1) € Q => fo + 2) €Q. 


Hence, f(xo + 2) —f (xo) € Q, which contradicts (8.9) and finishes the proof. oO 


For the coming example, recall that a functionf : R > Ris evenif f(—x) = f(x), 
for every real x. 


Example 8.34 Let f : R > R bea continuous function such that f (f(x)) = x7 + 1, 
for every x € R. Prove that f is even. 


Proof First of all, note that for every x € R we have 
fF@Q) =r +1 > fFEF@O)) =f? +1 
= fer +1=fe +) 
= fey +1=fCxy’ +1 
=> f(x) = +f(-*) 
Now, if f(a) = f(B) = 0, then 
a +1 =f(F(a)) =f) =fF(6)) = 6 +1, 


so that aw = +f andf has at most two zeros. Now, there are three possibilities: 


¢ f(x) 0, for every real x: by the IVT, f has a constant sign in R. However, since 
f(x) = +f(—+) for every real x, it follows that f(x) = f(—x) for every real x, and 
f is even. 

e f has a single zero, say at x = a: since f(—~) = +f(a) = 0, we must have 
a@ = —a, so that a = 0. On the other hand, 


fFO) =O +1=1=>fO) 40 
which is a contradiction. 
e f has exactly two zeros, atx = a and x = —a, for some a > 0: again by the IVT, 
f has a constant sign in (—a@, a). Let g(x) = f(x) — x. If f > 0 in (—a@, a), then 
g(0) =f(0)-—O>0 and g(a)=f(a)-—a =-a <0, 


so that there exists 0 < c < a for which g(c) = 0. If f < 0 in (—a, @), then 


(0) =f(0)-0 <0 and g(—a) =f(-—a)—(-a) =a >0, 
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so that there exists —a < c < 0 for which g(c) = 0. In any case, f admits a fixed 
point c. However, 


fO=c>c=fFO)=C+15¢-c+1=0, 


which is impossible. Oo 


We close this section by using the IVT to prove the continuity of the inverse of a 
continuous function defined on an interval. 


Theorem 8.35 Let I C R be an interval and f : I — R be a continuous 
Junction. Then, f is injective if and only if f is increasing or decreasing. Moreover, 
in this case: 


(a) The image J of f is an interval of the same type (i.e., open, half-open or closed) 
as I. 
(b) f—! : J > Lis continuous. 


Proof \ff is not injective, then f is neither increasing nor decreasing. Conversely, 
if f is neither increasing nor decreasing, then there exist a < b < cin J such that 
f(a) < f(b) = f© or f(a) = f(b) < fo). Suppose that f(a) < f(b) = f(c) (the 


other case is entirely analogous) and choose d € R such that 


max{f(a),f(c)} < d < f(b). 


The IVT assures the existence of x) € (a,b) and x; € (b,c) (therefore, xo, x; € J) 
such that f(x9) = d and f(x;) = d. In particular, f(xo) = f(x1) and f is not 
injective. 


(a) Corollary 8.30 showed that J is an interval. Since f is injective, we can suppose 
that f is increasing (again, the case of a decreasing f is analogous). Let J = 
(a, b), with a,b € R (the remaining cases can also be dealt with in analogous 
ways). 
If Im(f) = (c,d] or (—oo,d], with c,d € R, take x9 € (a,b) such that 
f(%o) = d. Since f is increasing, for x € (x0, b) we have f(x) > f(xo) = d, 
thus contradicting the fact that f(x9) € Im(f). Analogously, we show that 
Im(f) ¥ [c, d),[c, +00), [c,d], with c,d € R. Therefore, Im(f) is also an open 
interval. 


(b) Let’s look at the case of an increasing f, with 7 = (a,b) and Im(f) = (c,d) 
(once more, the analysis of the other ones is quite similar). We start by observing 
(according to Problem 8, page 176), that f—! is also increasing. Now, for a fixed 
yo € (c,d), let x» = f' (yo). Given € > 0, we want 5 > 0 such that 


yé(c.d), ly—yol <8 > fF 'O)-f 00) <e. (8.10) 
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To this end, let 69 = min{yo — c,d — yo} and take 0 < 6 < do (so that the 
condition |y — yo| < 6 suffices to guarantee that y € (c, d)). Letting x = f-!(y), 
we have y = f(x) and can rewrite (8.10) in the following way: we want 0 < 6 < 
59 such that 


If (x) — f (x0)| < 6 > |x — x0] <€. 


Notice that we can pick € > 0 so small that x9 t€ € (a, b) (otherwise, change the 
originally given e > 0 by asmaller one, so that this additional condition is satisfied). 
Recalling that f is increasing, take 


0 <6 < min{do,f(%o + €) —fo), fo) — fo — €)}- 
Then, 


f(x) —f@0) < 6 = fF) —fo) < fo + €) —f (x0) 
=> f(x) <fo +6¢) > x <x +e 


and, analogously, 
f(x) —f(%0) > —6 > x > x9 —€. 
In any case, the choice of 5 gives 


f(x) — fxo)| < 5 = —8 < f(x) —f (x0) < 6 
=> -€<xX-X<€E 


=> |x—x0| <€, 


as we wished to show. oO 


As a first application of the previous result, we shall now give an alternative proof 
for Theorem 7.9 and Problem 8, page 254. 


Example 8.36 Given n € N, the n-th power function is f : R — R given by 
f(x) = x". If n is even, write also f to denote its restriction to [0 + oo). Since 
f is continuous, increasing and f(k) = k" > k for every k € N, the IVT gives 
Im(f) = [0, +00). Hence, for every y > 0 there exists a single x > 0 such that 
x" = y, and we denote by x = 4/y. The previous theorem now guarantees the 


continuity of the function 


f7" : [0, +00) —> [0, +00) 
y r>  a/y 
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Analogously, if n is odd, then f is continuous, increasing, Im(f) = R and (letting 
/y be as above) 


f'i:R—R 
yrh> wy 


is continuous. In both cases, f—! is called the n-th root function. 


The next example establishes the continuity of the inverse trigonometric func- 
tions, introduced in Problem 1, page 198. 


Example 8.37 The restriction of the sine function to the interval [-4, +], which 
we shall also denote by sin : [-5, 4] — [1,1], is a continuous and increasing 
bijection. The arc-sine function is its inverse aresin : [—1, 1] > [—4, 4], so that, for 


x< [=1,1] and y¢ [=4, $l, 
arcsinx = y & siny =x. 


Thanks to Theorem 8.35, the arc-sine function os increasing and continuous. 

Similarly, the inverse of cos : [0, x] — [—1, 1] is the arc-cosine function arccos : 
[-1,1] — [0,2], which is continuous (again by Theorem 8.35) and decreasing. 
Note also that, for x € [—1, 1] and y € [0, z], we have 


arccosx = y ] COSy = x. 


Finally, the arc-tangent function arctan : R > (—3, z) is the inverse of the 
IU IU 


restriction tan : (-3, z) — R of the tangent function, so that it is also continuous 


22 
), we have 


and increasing. Also, for x € R and y € (—4, 4 


a) 


arctanx = y > tany = x. 


Problems: Section 8.3 


1. The continuous function f : (—oo, 1) U (1, +00) > R, given by 


x, if x € (—oo, 1) 


aa eo ere 


satisfies f(0) < ; < f(2) but there doesn’t exist any c in the domain of f such 


that f(c) = 3. Why does this example do not contradict the IVT? 
2. Compute the number of real solutions of each of the following equations: 


(a) |x| +1 =x. 
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10. 


12. 


13: 


8 Continuous Functions 


(b) cosx = x’. 
(c) sinx = 3. 


. Let f : R > R be given by f(x) = x° sin x, for every x € R. Show that its graph 


intersects every non vertical real line an infinite number of times. 


. Let n > | be an integer. Write down an explicit expression for a continuous 


function f : [0, 1] — [0, 1] with exactly n fixed points. 


. * Let J be an interval and X be a nonempty subset of J having the following 


properties: 


(i) For every xo € X, there exists 6 > 0 such that 7M (xo — 6, x0 + 5) C X. 
(ii) If (adn)n>1 is a sequence of points of X and / € J is such that a, — J, then 
le xX. 


Show that X = J. 


. (Bulgaria) Let n > 1 be an integer and aj, ao, ..., dn be given positive real 


numbers. Prove that the equation 


Vltayxt+ Vl +axt---+ Jl +a,x=nx 


has exactly one positive real solution. 


. Let x1, x2, ..., X%, be real numbers in the interval [0, 1]. Prove that there exists 


x € [0, 1] such that 


n 
|x —2x1| + |x—22| +--+ + |x-x,| = 5 


. (Leningrad) Let f : R — R be a continuous function satisfying, for every 


x € R, the relation f(x)f(x + 2) + f@ + 1) = 0. If f(0) = 0, show that there 
exist infinitely many values of x for which f(x) = 0. 


. (Leningrad) Let f : R — R be a continuous function such that f(x)f(f(x)) = 1, 


for every real x. If f(1000) = 999, compute f (500). 

(Australia) Find all real values of a for which, for every continuous function 
f : [0,1] > R satisfying the condition f(0) = f(1), there exists xo € [0,1 — a] 
such that f(xo) = f(x%o + a). 


. Let f : R > R be such that f(@ + 1)f(f(@) + 1) + 1 = 0, for every real x. Prove 


that f is not continuous. 
Let f : [0,1] — [0,1] be a continuous function, such that (f o f)(x) = x for 
every x € [0, 1]. 


(a) Show that f is an increasing or decreasing bijection. 
(b) Iff is increasing, show that f(x) = x for every x € [0, 1]. 
(c) Show that there are infinitely many possibilities for f, if it is decreasing. 


(Leningrad) The continuous functions f, g : [0,1] — [0,1] are such that f o 
g = gof.Iff is nondecreasing, prove that there exists 0 < a < 1 such that 


f(@) = sa) =4 


8.3 


14. 


16. 


17. 
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(Crux) Let f : R — R be a continuous function which assumes positive 
and negative values. Given a natural number k > 2, prove that there exists a 
nonconstant AP (a, a2,..., a) such that 


f(ai) +f(a2) + +++ +f (ax) = 0. 


. (TT) Prove that, for each natural number n, the graph of any continuous 


increasing function f : [0,1] — [0,1] can be covered by n closed rectangles, 
each of which having area + and sides parallel to the coordinate axis. 
(Leningrad) Let f : R — R be a continuous function such that, for every real x, 
we have f(x + f(x)) = f(x). Prove that f is constant. 

(Belarus) Find all functions f, g,h : R — R such that, for every x,y € R, we 
have 


faety) +e? +y) =hQy). 


Chapter 9 
Limits and Derivatives 


In this chapter we study derivatives of functions and some of their applications. 
Along the way, we show how to use derivatives to solve the problem of finding 
the monotonicity (resp. concavity) intervals of a differentiable (resp. twice differen- 
tiable) function, as well as to build an accurate sketch of the graph of such functions. 
In turn, the analysis of such problems will motivate several interesting applications 
of the concept of derivative to problems of maxima and minima. 


9.1 Some Heuristics I 


As a motivation for what is to come, in this section we heuristically discuss two 
problems which gave birth to the notion of derivative of a function. 

Let’s start by trying to define the tangent line to the graph of a function f : 
(a,b) > Rata point A(xo, f(xo)) on it, 0 so that xo € (a, b). Taking x; € (a, b) \ {xo} 


and letting B; (x;,f(%1)), we say that AB, isa a secant to to the graph of f and passing 


through A. In Fig. 9.1, we consider secants AB, and AB to the graph of f. 


A little geometric intuition makes it plausible to guess that a generic secant line 
—_ 


AB should come closer and closer to the tangent line to the graph of f at A, as long 
as B approaches A along the graph (or, which is the same, as long as x approaches 
Xo in (a, b)). 

If x; € (a,b) \ {xo} and B(x, f(x1)), elementary analytic geometry (cf. Chap. 6 


f [4], for instance) shows that AB has equation 


FQ) ate m=) (9.1) 


x1 — Xo 


y—f (xo) = ( 
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Fig. 9.1 Secants AB, and 
AB, to the graph of f 


on the other hand, if we can define the line tangent to the graph of f at A, and it is 
not vertical, then its equation must be of the form 


y—f (Xo) = m(x — x0), (9.2) 


for some me R. 
Comparing (9.1) and (9.2), and taking into account the discussion at the 
previous paragraph, we are led to the conclusion that the quotients fey Co) should 
approximate m better and better, provided x; approximates x better and better. Thus, 
we conclude that the line tangent to the graph of f at A is that of Eq. (9.2), where m 
is the limit value of the quotients aan when x; comes closer and closer to xo. 
fav Sf (xo) 


X1—Xx0 
of the function f at x9, which we denote f’(xo) or, in classical notation, £ (x9). 
Writing x in place of x;, we summarize the previous discussion by “defining” 


This limit value of the quotients when x; approaches xo is the derivative 


df, . a f(x) —f (0) 
—(x)) = lim ————— ., 
dx xx X— XO 
where the notation lim means that the closer x is to xo, the closer Aes 1S 


xX>X0 


to g (xo). Hence, for small values of |x — xo|, we expect to have 


Ff (gy wf) =Feo) 


dx x— Xo 


Classically, the differences f(x) — f(xo) and x — xo were respectively denoted by 
Af ie and Ax i and were called the variations of f and of the independent variable 
in the interval [xo, x]. Accordingly, a = is the rate of change of f (with respect to 
the independent variable) in the interval [xo, x]. Therefore, for small |x—xo9|, we have 

df Af \* 


ie = Ke 


x0 
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Now, suppose we are in a situation where x represents time and f represents some 
quantity that evolves with time. Writing fin place of x and fo in place of xo, it follows 
from the above that 

t 
Fig) = him 
La At lio’ 
so that £ (to) can naturally be called the instantaneous rate of change of f with 
respect to ¢, at instant fo. 

As a particular situation of that of the previous paragraph, if f(t) measures 
the displacement of a point object along a real line at instant ¢, then £ (10) is the 
instantaneous velocity of the point object at instant fo, and is the starting point of the 
discussion of Kinematics at most Physics books. 

The circle of ideas we developed up to this point, and much more, bears the 
name of Differential Calculus and is considered to be the creation of two of the 
most brilliant minds mankind ever produced, Newton and Leibniz. ! 

The rest of this chapter is devoted to a thorough development of the concepts, 
results and applications of the Differential Calculus. 


9.2 Limits of Functions 


Let be given a continuous function f : (a,b) > R and xp € (a,b). Recalling our 
intuitive discussion so far, we conclude that if there exists a reasonable notion of 
tangent line to the graph of f at the point A(xo,f(%0)), and if such a line is not 
vertical, then its equation must have the form (9.2), where 
i= Oe (9.3) 
X—>x0 x— Xo 

Here, the expression at the right hand side above intuitively represents the limit value 
of the quotients FQ) fo) © = G0) as x goes to (i.e., approaches) xo, provided such a “limit” 
exists in a proper sense (to be precised in a while). 

Since our discussion rests upon rather naive grounds, we start to fix this now by 
setting some preliminaries and giving a rigorous definition of the notion of limit of 
a function at a point. 


Definition 9.1 Given a € R, a neighborhood of a is an interval of the form (a — 
r,a +r), where r is a positive real number. In this case, we say that r is the radius 
of the neighborhood (a — r,a + r) and that each x € (a—r,a +r) approximates a 
with error less than r. 


‘Gottfried Wilhem Leibniz, German mathematician and philosopher of the XVII century. Together 
with Sir Isaac Newton, Leibniz is considered to be one of the creators of the Differential and 
Integral Calculus. Some of the notations used in Calculus up to this day go back to Leibniz, having 
survived the ruthless test of time. 
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Let J C R be an interval, let x9 € J and f : J \ {xo} > R be a given function. For 
a given real number L, our first task is to formulate a precise definition for the claim 
that f(x) can be taken as close to L as we wish, provided we take x € / sufficiently 
close to (but different from) xo. 

A little reflection allows us to conclude that a reasonable formulation of this 
concept is obtained if we ask that, to each given neighborhood of L, there exists a 
neighborhood of xo that is applied to the former one by f. 

Since a neighborhood of x9 is completely determined by its radius r, and 


x E (xX) —1,X0 +1) \ {x0} & 0 < |x— x0] <7, 


we can summarize the previous discussion in the following definition (see Fig. 9.2). 


Definition 9.2 Let J C R be an interval, let x9 € J and f : I \ {xo} > R be a given 
function. We say that f has limit L when x goes to xo, and write 


lim fwm=L, (9.4) 


if, to each given real number € > 0, there corresponds a real number 6 > 0 such that 
xeEl, 0< |x-—x0| < 6 => |f@) -L| <e. (9.5) 


In words, (9.5) happens if, to each arbitrarily given error « > O for L, there 
corresponds an error 6 > 0 for xo such that, if x € J \ {xo} approximates xo in J with 
error less that 5, then f(x) approximates L with error less than e. 

Geometrically, we want that, to each x € I sufficiently close to (but different 
from) xo, the point of the graph of f with abscissa x belongs to the gray strip 
of Fig. 9.2. 


Fig. 9.2. Limit of a function at a point 
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At this point (and as it happened in our study of continuity), it is worth stressing 
that, to justify the validity of a specific limit by applying the definition given above 
is the same as to play the following cat and mouse game: to each arbitrarily given 
error € > 0 for the limit candidate L, we ought to be able to find an error 5 > 0 for 
Xo (which, in general, will depend both on € as on xo itself) so that the validity of 
the condition 0 < |x — xo| < 6 for some x € J implies |f(x) — L| < €. 

Having gone through the study of continuity, the reader has surely acquired some 
familiarity with the dynamics of finding which 6 > 0 are adequate to some given 
€ > 0. In spite of this, and as a remembrance, we shall carefully work out two 
examples below. In all that follows, and whenever there is no danger of confusion, 
we shall frequently omit explicit references to the domain or codomain of the 
functions f involved, thus focusing on the expressions that define f(x) in terms of 
the independent variable x. Whenever this is so, we shall implicitly assume that the 
domain of f is the maximal one (cf. Sect. 6.1), whereas its codomain is the set R of 
real numbers. 


Examples 9.3 


(a) lim,—.2(—2x + 7) = 3: let € > O be given. Departing from x € R such that 
0 < |x—2| <6, we have 


|\(—2x + 7) — 3| = | —2x+4+ 4| = 2|x- 2] < 26. 
Therefore, choosing 5 > 0 so that 25 < €, we conclude that 
xER, 0< |x—2| <6 => |(—2x+ 7) —3] < 26 <e, 


as wished. 
(b) lim,,3.x? = 9: again, let € > O be given. Starting with x € R such that 0 < 
|x — 3| < 6, we have 


|x? — 9| = |x —3||x + 3] < blx-3 46 
< 8(|x— 3| + 6) < 66 + 6), 


where we used the triangle inequality in the next to last passage above. Hence, 
if it is possible to choose 6 > 0 so that 6(6 + 6) < €, we will have 


xéER, 0<|x—-3| <6 > |x’ —9| < 6(6 4 6) <e, 
as wished. It now suffices to show that it is possible to choose such a 6 > 0, 
and to this end we just have to solve the inequality 5(6 + 6) < €. By doing so, 
we get 


0<d< Ve+9-3. 


Looking back at the previous examples, we conclude that if we have a function 
f : T\ {xo} — R and want to prove that lim,_,,, f(x) = L, then, as in Sect. 8.1, the 
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key to find out which 5 > 0 works for a given € > 0 is to reason in the following 
way: starting with an x € J such that 0 < |x — xo| < 6, we estimate from above the 
error |f(x)—L| in terms of 6, thus getting an inequality of the form |f(x)—L| < E(6). 
Here, E represents a certain function of 6; in item (a) we got E(5) = 26, whereas 
in item (b) we found E(6) = 6(6 + 6). Then, we impose that such an error E(5) 
does not surpass ¢, thus finding the appropriate values of 6 (usually, this second step 
reduces to solving, for 5 > 0, the inequality E(6) < e). Finally, if 6 > 0 satisfies 
E(6) < ¢, then we clearly have that 


x€T, 0< |x—x0| <6 > |f@) -L| < ES) <€, 


as wished. 

Notice that the only difference between the discussion above and that of Sect. 8.1 
lies in the fact that, here, we have to start by assuming |x — xo| > 0, i.e., x # xo. On 
the other hand, and as the coming proposition explains, such a difference was not 
relevant to the previous examples because both functions x > —2x + 7 and x b> x? 
are continuous. 


Proposition 9.4 Let J C R be an interval and f : I > R be a given function. For 
xo € I, we have f continuous at xo if and only if 


pare (x) = f (0). (9.6) 


Proof First of all, let f be continuous at xo. Definition 8.2 guarantees that, given 
€ > O, there exists 6 > O such that 


xe, |x—xo| <6 => |f@) —f(0)| <e. 


In particular, if x € J and 0 < |x — x9| < 6, we still have |f(x) — f(x0)| < €, so 
that (9.5) is satisfied. Therefore, lim,_,,, f(x) = f (xo). 

Conversely, let lim,—,,. f(x) = f(xo). According to the definition of limit, given 
€ > 0, there exists 5 > 0 such that 


xel, 0< |x-—x9| < 6 > [f(x) —f%)| <e. 


However, since the condition |f(x) — f(x9)| < € is trivially satisfied for x = xo, we 
can certainly write the implication above as 


xe, |x—xo| <6 => |f(x) —f@o)| <e. 


Hence, f is continuous at xo. oO 


Example 9.5 Givenf : R\ {1} > R such that f(x) = “=3°+? for every x # +1, 
we want to compute lim,_,; f(x). To this end, we initially observe that 1 is a root 
of both the numerator and the denominator of the defining expression for f(x), with 
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x — 3x? +2 = (x— 1)(X? — 2x — 2) and x? — 1 = (x— 1)(x + 1). Therefore, on 
R \ {£1} we have 


_ (x — 1)(x* — 2x — 2) _ x —2x-2 


fe) @—-D@+H «41 


x2—2x—2 


TF 1s continuous, 


Now, since the function g : R \ {-1} — R given by g(x) = 
the previous proposition gives 


3 
lim f(@) = lim g(x) = (1) = —5. 


A rather important fact on limits of functions is that if lim,—,, f(x) = L, with 
L # 0, then there exists a neighborhood of xo such that f has the same sign of L 
in all points of its domain which belong to this neighborhood (with the possible 
exception of xo itself). This is essentially the content of the coming result, which 
extends Lemma 8.20 to limits of functions and for this reason is also known as the 
sign-preserving lemma. 


Lemma 9.6 Let I C R be an interval, x» € I and f : I \ {xo} > R be a given 
function. If lim, f(x) = L, with L # 0, then there exists 6 > 0 such that 


xé€l and 0< |x-x|<6> gh ae nee 
—3L/2 <f(x) <-L/2, ifL<0 

Proof Suppose L > 0 (the other case is completely analogous). By the definition of 
limit, given € = £ > 0 there exists 6 > 0 such that 


L 
xel, 0 <|x— xl <3 = [f@) —L <>. 


Hence, for each such x we have -§ <f@-—-L< £, which is the same as £ < 


f@< x. Oo 

We now collect some arithmetic properties on limits that pretty much simplify 
their computation. For item (c) of the coming proposition, it is worth observing 
the following: given an interval J C R, xo € J and g : I \ {xo} — R such that 
lim,., g(x) = M 4 0, Lemma 9.6 guarantees the existence of r > 0 such that g 
doesn’t vanish on J\ {xo}, where J = IN (xo—r, xo +r). Hence, when considering the 
function : such that *(x) = we will always implicitly assume that its domain 
is restricted to J \ {xo}. 


a 
g(x)’ 


Proposition 9.7 Let 1 C R be an interval, let x)» € I andf,g : I\ {xo} > R be 
given functions. If lim,x,. f(x) = L and lim,+x) g(x) = M, then: 
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(a) lim,..(f + g)®) = LM. 
(b) Himy x9 (fF 7 8) = L 7 M. 
(c) limy-+x, (£) (x) = 4, provided M # 0. 


Proof In all cases below, we let € > 0 be given. 


(a) Let’s do the proof for f + g, the proof for f — g being completely analogous. As 
was previously hinted, our strategy will be to try to estimate |(f+g)(x)—(L+M)| 
by excess, in terms of |f(x) — L| and |g(x) — M|. This is easily accomplished 
with the aid of the triangle inequality, which gives 


If + s)@)—-(L+M)| =|) —-L) + (g@) -M)| 
S If) —L| + |g@) — MI. 


Thus, in order that |(f + g)(x) —(L+ M)| < «€ for x € J close to (but different 
from) Xo, it suffices for us to have |f(x) — L| < 5 and |g(x) — M| < 5. 
Since 5 > 0 and lim, +,, f(x) = L, lim,+,, g(«) = M, the definition of limit 


guarantees the existence of 5,6. > 0 such that 


x Eel, 0< |x—x0| < 6) > |f@) -L| < 


NIE 


and 


x€T, 0 < |x—Xx0| < db. > |g) -—M| < 


Therefore, letting 5 = min{d,, 52} > 0 and taking x € J such that 0 < |x—x0| < 
5, we get 


f+ s)@)—-(@+M)| <f@)-L + |s@)-M 


goat 
~+.==€e. 
=< <2 


(b) Here again, we estimate |(fg)(x) — LM| by excess in terms of |f(x) — L| and 
|g(x) — M|. To this end, it follows from triangle inequality that 
|(fg) (x) — LM] = |f(x)(g@) — M) + (F@) — L)M| 
< FON lg@) — MI + If) — LI|M| 
< (ff) — L| + |L))|g@) — M] + [F@) — L||M| 
= |f(~) — L|lg@x) — M| + |Ellg@) — M] + MIF) — LI. 
Therefore, in order that |(fg)(x) —LM| < ¢ for x € I close to (but different from) 


Xo, it suffices for us to have |f(x) — L||g(x) — M], |L||g(x~) —M| and |M||f(x) — L| 
all less than $. In turn, these inequalities hold provided we have 
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iF) — Lhe le) — MI < Bs 


Ig(x) — M| < and [f(x) —L| < 


€ € 
3(|L| + 1) 3(|M| + 1) 


In short, it suffices to have 


' J€ € 

If (x) — LI < min | 3° Ea 
- € € 

je) <a] < min}, 


Now, letting €; = min | ai seit > O and €> = min \V5 aro > 
0, the definition of limit guarantees the existence of errors 5,62 > 0 such that 
If (x) — L| < €; for every x € J such that 0 < |x —x9| < 6), and |g(x) —M| < €2 
for every x € I such that 0 < |x —x9| < 62. Hence, setting 6 = min{d,, 52} > 0, 
we conclude that x € J and 0 < |x — xo| < 6 imply both |f() — L| < €; and 
|g(x) — M| < €2, as needed. 
The sign-preserving lemma gives 59 > 0 such that |g(x)| > val for allx € J 
satisfying 0 < |x — xo| < do. Sticking to the notations set forth in the paragraph 
that immediately precedes this proposition, we look at f as the function f : 
J \ {xo} > R, where J = IN (xo — 50, x9 + 40). 


and 


(c 


wm 


Since £ = tf: i, by item (b) it suffices to show that lim... + 2) = To this 


end, and taking ve account that |g(x)| > MM for x € J \ {xo}, we obtain ieee such 


an x) 


11] _ [e@)—M 
Fo M|~ leo) < gla) Mt 


way - 4 < €, we need to have |g(x)—M| < “S*. To fulfill 


the last condition above, we choose (by invoking once more the definition of limit) 
6, > 0 for which 


Therefore, in order that | 4, 
Me 
xeJ, 0< |x—xo| < 6) > |g(x) —M| < a 


Finally, letting 6 = min{do, 61} > 0 a fants x € J such that 0 < |x — xo| < 4, 
we have |g(x)| < fl and |g(x) — M| < ““£, as needed. Oo 
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An easy induction allows us to extend the formulas of items (a) and (b) of the 
previous proposition to a finite number of functions. More specifically, if J is an 
interval, xo € J and fi,...,fn : 1 \ {xo} — R are such that lim,-.,, f(a) = L; for 
1 <j <n, then 


lim(fjitfhp+---+fJ@)=hth+---+l (9.7) 
X>X0 
and 
dim (fifa -- fr) @) = Lia. Ln. (9.8) 


In particular, if k € N and f : I \ {xo} > R is such that lim,_,,, f(x) = L, then 
lim f(x)* = L*. 
X>XQ 


From now on, we assume the validity of these remarks without further comments. 
Our next result is usually referred to as the squeezing theorem. As we shall see 
right after its proof, it is quite useful for the actual computation of limits. 


Proposition 9.8 Let I be an interval, x) € I and f,g,h : I \ {xo} > R be such 
that g(x) belongs to the interval of ends f(x) and h(x) for every x € I \ {xo}. If 
lim, +x) f(x) = limy>x, A) = L, then lim, x) g(x) exists and is also equal to L. 


Proof Given € > 0, we want to find 6 > 0 for which the conditions x € J and 
0 < |x —xo| < 6 imply |g(x) — L| < «. To this end, if f(x) < g(x) < h(a), then 
f(x) —L < g(x) —L < h(x) — Land is easy to conclude that 


|g(x) — L] < max{|f(x) — L], |A@) — LI}. 


If h(x) < g(x) < f(x), one reaches the inequality above in essentially the same way. 
Now, the definition of limit gives 6,, 62 > 0 such that 


xeEl, 0< |x—xo9| < 6) > [FQ@) -L| <e€ 
and 
xEl, 0< |x—Xxo| < b2 > |h(x) -L| <e. 


Therefore, letting 6 = min{é,, 62} > 0 and taking x € J such that 0 < |x —x0| < 6, 
we get |f(x) — L| < € and |A(x) — L| < €, so that 
|g(x) — L] < max{|f(x) — L], |h@) — LI} <. 


oO 


Concerning the computation of limits, the coming corollary is quite a useful tool. 
For its statement, recall that a function f : J > R is bounded if there exists M > 0 
such that 
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If(x)| <M, Vxel. 


Corollary 9.9 Let I be an interval, x9 € I and f,g : I \ {xo} — R be such that f 
is bounded and lim,-,y, g(x) = 0. Then, lim,.., f(x)g(x) = 0, even if lim,» f(x) 
doesn’t exist. 


Proof If |f(x)| < M for every x € I \ {xo}, then 0 < [f(x)g(«)| < M|g(x)| for every 
x € 1 \ {xo}. Since lim,—,, |g()| = 0 (check this!), it follows from the squeezing 
principle that lim,_,,, [f(x)g(x)| = 0. Therefore, lim,_,,, f(~)g(x) = 0. Oo 
Example 9.10 If f : R \ {0} — R is given by f(x) = sin +, then |f(x)| < 1 for 
every x € R \ {0}. Since lim,.9x = 0, it follows from the previous proposition 


that lim,—+9 x - sin 1 = 0. Note that this result agrees with Problem 10, page 255, 
and (9.6). 


The squeezing theorem also allows us to compute the fundamental trigonomet- 
ric limit, which will reveal itself to be of crucial importance in the next section. 


sinx =] 


x 


Lemma 9.11 lim, 


Proof Since we wish to compute a limit, we can restrict ourselves to the interval 


|x| < 4. Suppose first that x > 0. Letting e(AB) = x be the length of the arc AB 


(cf. Fig. 9.3), we have 


sinx = BD < AB < &(AB) =x, 


so that sin < 1. On the other hand, it is well known (cf. Chap. 5 of [5], for instance) 


that the area of the circular sector AOB equals z - 3, = 3, whereas the area of the 


Fig. 9.3. The fundamental 
trigonometric limit C 


WN 
LY 
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triangle AOC equals 5 AC ; hence, we have 


x = £(AB) < AC =tanx, 
so that cosx < 92, 
Combining the two inequalities deduced above, we obtain 


sin 
cosx < —— <1 (9.9) 


for 0 < x < 4. However, since both x + cosx and x mss 


- are even functions, 
these inequalities still hold true for == < x < 0. Then, it follows from (9.9), 
together with the squeezing theorem and the continuity of the cosine function, that 


lim, “+ siMX exists and equals 1. oO 


Remark 9.12 The attentive reader may object the former proof under the following 
argument: the very definitions of the sine and cosine functions, as given in 
elementary school, rely upon the fact that a circle of radius 1 has length 27; in 
turn, as we shall see in Sect. 10.10, a thorough computation of length of a circle 
involves the use of the derivative of the sine function, so that we are in presence of 
a circular reasoning. Although this is indeed the case, we observe that we will fix 
this situation in Sect. 11.5, when we shall construct the sine and cosine functions 
without invoking any concepts or results of Euclidean Geometry. In particular, at 
that future time we will show, by purely analytical means, that lim,_,) “8 = 1 and 
sin’ x = cos x. For the time being, we suggest that the reader takes the proof of the 
previous lemma simply as a heuristic argument for justifying the validity of both 
these results. 


Let be given an open interval J, a point x9 € J and a function f : J \ {xo} > R. 
In a way entirely analogous to what we have done so far, we can define for f the 
concept of one-sided or lateral limits at x). More precisely: 


(i) If 1M (—co, x0) # G, we say that L is the left-handed limit of f as x goes to Xo, 
and write lim,_,,,_ f(x) = L, if, for each given € > 0, there exists 6 > 0 such 
that 


xET, X-8 <x <x > [Ff -L| <e. (9.10) 

(ii) If 1M (xo, +00) F Y, we say that L is the right-handed limit of f as x goes to 

Xo, and write lim,.,.4f(x) = L, if, for each given € > 0, there exists 6 > 0 
such that 


xE€T, xm <x<x +6 > [f(x -L| <e. (9.11) 


The idea behind the notation x — xo— (resp. x — x9+) should be clear to the 
reader: we write x — xo— (resp. x — xo+) in left-handed (resp. right-handed) 
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Fig. 9.4 The right-handed limit lim,_,,.4 f(x) 


limits, for x approaches xo from the left (resp. from the right), i.e., through values 
smaller (resp. larger) than xo itself. In yet another way, when x — xo— (resp. x > 
Xo+), we have x = x9 — (something positive) (resp. x = xo + (something positive)). 

Figure 9.4 gives a geometric interpretation for the notion of right-handed limit. 
Note that, for 0 < |x — xo| < 4, the point (x, f()) only belongs to the gray strip if 
Xo <x < x0 +6. We left to the reader the task of drawing a figure that provides the 
analogous geometric interpretation for left-handed limits. 

Yet with respect to one-sided limits, it is immediate to state and prove for them 
results analogous to those of Proposition 9.7 (cf. Problem 4). We also stress that, 
for a function f : (a,x9) — R, the notions of limite and left-handed limit of 
f at xo coincide (for, there is simply no way of approaching x9 from the right, 
while belonging to the domain of f). Thus, in such a case we shall generally write 
lim,-+x) f(x) = L instead of lim,-+,,—f(x) = L. Obviously, for a given function 
Ff : (x0, b) — R, analogous remarks are true for limits and right-handed limits at xo. 
In this respect, see also Problem 5. 

We close this section by defining the notions of infinite limits and limits at 
infinity, and sketching some elementary (though useful) results on them. 


Definition 9.13 Let be given an interval J, a point x9 € J and a function f : J \ 
{xo} > R. We write 


lim f(x) = +00 
x>X0 
if, for any given M > 0, there exists 6 > 0 such that 
xéET, 0< |x-x0| <6 > f(x) > M. 


We leave to the reader the task of stating definitions analogous to the one above 
for the concepts of lim,_,,, f(x) = —oo and lim,_,,,+ f(x) = oo. 
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The next result establishes two simple, yet useful, arithmetic rules for infinite 
limits. Since its proof resembles that of Proposition 9.7, we also leave it as an 
exercise for the reader. 


Proposition 9.14 Let be given an interval I, a point x9 € I and functions f, g : 
I\ {xo} > R. 


(a) If lim f(x) = L > Oand lim g(x) = +00, then lim f(x)g(x) = oo. 
x—>X0 xX>X0 x>X0 
(b) If lim f(x) = L < Oand lim g(x) = +0, then lim f(x)g(x) = oo. 
x9 X>X0 x—>x0 
The prototype of infinite limits are those collected in the coming example, whose 
proof is also an easy exercise for the reader. 


Example 9.15 


(a) Iff : (xo, +00) > R is given by f(x) = —L, then lim,_,,, f(x) = +00. 


x—x0’ 


(b) Iff : (—co, xo) > R is given by f(x) = | then lim,+x) f(x) = —oo. 


X—XQ? 


Let’s now turn our attention to limits at infinity. 


Definition 9.16 Given f : (a,+oo) > Rand LZ € R, we write lim,-++4o f(x) = L 
if, for a given € > 0, there exists A > a such that 


x>A=> |fQX)-L| <e. 


As before, we leave to the reader the task of stating definitions analogous to 
the above one for lim,-,-.o f(x) = L and limy-++o0 f(x) = oo. On the other 
hand, note that Propositions 9.7, 9.8 and 9.14 remain true if we change I \ {xo} by 
(a, +00) (resp. (—o0, b)) and xy by +c0 (resp. —0o); moreover, the proofs of the 
corresponding results in these cases are essentially the same. 

Let us see an interesting (and useful) example of application of Proposition 9.14. 


Example 9.17 Letn € N andf : R > R be the polynomial function given by 
F(R) = ay" + ay x”! +--- + ax + ao, 


with ao, a,,...,d, € Randa, > 0. Prove that: 


(a) If nis even, then lim, 450 f(x) = +00. 
(b) If nis odd, then limy-++00 f(x) = 00. 


Proof Since limjy-+490 x" = +00 if nis even, lim,++00 x” = Loo if nis odd and 
limp} +00 (dn + “St +--+ + sh + B) = a, (verify these claims!), it suffices to 
write 


FQ) = x" (ay + ee aes ed, 


and, then, apply Proposition 9.14 (with oo in place of xo). oO 


9.2 Limits of Functions 289 


One of the most important applications of infinite limits and limits at infinity is in 
identifying horizontal and vertical asymptotes of (the graph of) a function, according 
to the coming 


Definition 9.18 If / is an interval, x) € J and f : J \ {xo} — R is a function such 
that lim,.,, f(x) = -boo, then we say that line x = xp is a vertical asymptote of 
(the graph of) f. Analogously, if f : (a, -+oo) > R (resp. f : (—oo, b) > R) is such 
that lim,-+ +00 f(x) = L (resp. lim,-+—99 f(x) = L), then we say that line y = Lisa 
horizontal asymptote of (the graph of) f. 


Examples 9.19 


(a) Figure 6.17 sketches the graph of the tangent function in the interval (—4, 4). 
In view of the limits lim, z sinx = | and lim, cosx = 0, it follows from 
Proposition 9.14 that lim,.z tanx = +oo0. Analogously, lim,.—z tanx = 
—oo, so that lines x = +4 are vertical asymptotes of this graph. 

(b) Figure 9.5 sketches the graph of f : R \ {—1} — R, given by f(x) = =. 
This graph can be obtained from that of the inverse proportionality function (cf. 
Fig. 6.14) with the aid of Problem 10, page 194. Indeed, since 


2 x4+2-2 7) 
acai a9 (9.12) 


x+1 x+1 ~ os 


Fig. 9.5 Graph of x > =a 
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we can sketch the graph of f in the following way: firstly, we sketch the graph 
of x be +; then, we translate it one unit to the left, thus getting the graph of 


xt —; thirdly, we stretch _ second graph in the vertical direction by factor 
oH 


2, obtaining the graph of x > ——; we oe reflect the result along the horizontal 


aE 


axis, atriving at the graph of x + —— —; we end by ee this last graph 


a ? 
two units above, thus finally obtaining the graph of x H = =. 
Since lim,.-;4 2x = 2 and lim,+—)+ = = +o, it follows from 
Proposition 9.14 that lim,—1+ a= = oo, so that line x = —1 is indeed a 
vertical asymptote of the graph. On the other hand, it seals © follows from (9.12) 
that lim,++o5 = = 2, so that line y = 2 is a horizontal asymptote of the graph. 
Finally, note that these results are in perfect accordance with the geometric intuition 


given by the sketch of the graph. 


Problems: Section 9.2 


1. Let J be an interval and xo € J. Given a function f : J \ {xo} > R, prove that if 
lim,—+x) f(x) exists, then it is unique. 

2. * Let J be an interval, x) € J and f,g : I \ {xo} — R be such that there exist 
lim, f(x) = L and limy-+,, g(x) = M. If f(x) = g(x) for every x € I \ {xo}, 
prove thatL > M. 

3. * Establish the generalizations of items (a) and (b) of Proposition 9.7, as 
discussed right after the proof of it. 

4. * Extend Proposition 9.7 to one-sided limits. 

5. * Letf : (a,b) > R bea given function, and x9 € (a, b). Prove that lim, f(x) 
exists if and only if the one-sided limits lim,_,,,4 f(%) and lim,,.— f(x) exist 
and are equal. In this case, show also that lim,,,, f(x) equals this common 
value. 

6. Prove Proposition 9.14 and its analogue for limits at infinity. 

7. * Let —oo <a <b < +00 andf : (a,b) — R be a continuous and increasing 
(resp. decreasing) function. If lim,.,+ f(x) = L and lim,.,_ f(x) = M, with 
—oo < L,M < +00, prove that Im (f) = (LZ, M). 

8. Compute the following limits: 


(a) lim,+9 jaca (4) lim,>; uy 
b) lim oe 
(b) lim, = =e (e) lim, cos) ) 


(c) lim,+o sin(3x) 


(9 ime-no = 


9. If f,g : (a, +coo) — R are such that f is bounded and lim,-,+.0 g(x) = 0, 
prove that lim,_, +00 f(x)g(x) = 0. Then, use this fact to compute the two limits 
below: 


sinx 


(a) limy++00 ra 
(b) lim, +00 sin 


x+cosx"* 


9.2 


10. 


11. 


12. 


13. 
14. 


15. 


16. 
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* Let J be an interval, xo € J andf : J\ {xo} — R be such that lim,,, f(x) = L. 
If (an)n>1 is a sequence in J \ {xo} satisfying limp++0o0dn = Xo, prove that 
lim,++00f (an) = L. Then, use this fact to conclude that lim, sin + does not 
exist. 

* Given f : (A, +00) > R, we say that line y = ax+b is an oblique asymptote 
(of the graph) of f provided lim,_,+.0(f(x) — (ax + b)) = 0. If this is so, prove 
that 


and b= lim (f(x) —ax). 
xX—>+00 


Then, elaborate the concept of oblique asymptote y = ax+b forf : (—oo, A) > 
R and show how to compute a and b in such a case. 

In each of the following items, find out (cf. previous problem) the horizontal, 
vertical and oblique asymptotes of the given function: 


(a) f : R \ {0} > R such that f(x) = x + 1 for every real x # 0. 

(b) f : R\ (-a,a) > R such that f(x) = 2%? — a for |x| > a. (Notice— 
cf. Chap.6 of [4], for instance—that the graph of f is the portion of the 
hyperbola . - x = | situated in the upper semiplane of the cartesian 


plane.) 
(c) f : (0, +00) > R such that f(x) = x? sin + for x > 0. 


Compute lim, +00 ( x+ fxt Jx—- Vi). 


Let f : (0, +00) — (0, +00) be an increasing function. If limy.+ 


f(nx) 
F(x) 
Let (Gn)n>1 be the sequence defined by a; = 2/2 and, for an integer n > 1, 


An = 2" VY 2-V24+V24+00-+ af? with n square root signs. In this respect, 


do the following items: 


(a) Ifb, = Y2+V2+---+ V2 forn > 1, withn—1 square root signs, show 


that be = 2+ b,- and conclude that b, < 2 for everyn > 1. 

(b) Write b, = 2cos6,, with 0 < 6, < 4, to conclude that 6, = On —1 for 
every n > 2. 

(c) Show that a, = 2”t! sin oo = 2+! gin a Then, use this fact to show that 
limy++00 dn = 1. 


fo) 
re =} 


prove that lim,-++00 = 1 foreveryn €N. 


(IMO) Find all functions f : (0,-++co) — (0,+00) satisfying the following 
conditions: 


(a) fOf(y)) = yf) for all x,y > 0. 
(b) limy+ 400 f(x) = 0. 
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9.3. Basic Properties of Derivatives 


With the concept of limit at our disposal, we return to the discussion that lead 
to (9.3). 


Definition 9.20 Let 7 Cc R be an open interval and f : J > R be a given function. 
For a fixed xo € I, we say that f is differentiable at xo if the limit 


kim £2 feo) 
im ————— 


x XO xX — X09 


does exist. In this case, this limit is called the derivative of f at xo, and is 
denoted f’ (xo). 


In the settings of the previous definition, we observe that the existence of f’ (xo) 
is equivalent to the existence of the limit 
_ fo + h) — fo) 
lim. ——_—_—__. 
h>0 h 
Indeed, on the one hand, letting x» +h = x we have h = x — xo, and obviously 
h — 0 & x — x. On the other hand, when we write f(x + h) we are tacitly 
assuming that h is so small that xp +h € I; however, since we are computing a limit 
and J is open, such a supposition doesn’t impose any restriction upon the definition 
of derivative. 
In short, f being differentiable at x9 € J, we have 
_ h) — 
fein FQ) ~ Fo) _ 5 Oo +B) ~ FR) 


xX—>X0 Xx — X09 h>0 h 


(9.13) 


From now on, whenever convenient we refer to any one of the fractions above as the 
Newton’s quotient of f at xo. 
The coming examples compute derivatives of some simple functions. 


Example 9.21 


(a) If f : R > R is a constant function, then f is differentiable and f’(xo) = 0 for 
every xo € R. 

(b) Ifn € N andf : R > Ris such that f(x) = x” for every x € R, then f is 
differentiable, with f’(xo) = nx7—! for every xo € R. 

(c) Ifn € Z is negative and f : R \ {0} — R is given by f(x) = x”, then f is 
differentiable and f’ (xo) = nx}! for every xo € R \ {0}. 


Proof (a) Let’s apply the first equality in (9.13): letting f(x) = c for every x € R, 
we get 


Pe) ste in — = ee 
X>X0 X— X09 x>xX0 X — XQ X>X0 
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(b) Let’s apply the second equality in (9.13): by Newton’s binomial formula, we 


get 


iva 1 1 n 
eee) = 7 (eo +h)” — x) = i (> ({}ew -*| 


ll 
Pe 
> 
‘Seemann’ 
= 
= 
Hs 
= 
a 
ll 
AY 
& 
+ 
LN 
ne 
= 
| 
> 
= 
T 


Then, successively applying (9.7) and (9.6), we obtain 
/ 4: —1 “.(n —kpk-1 
k=2 
= mi + So him ((;) <0") = nt, 
k=2 


(c) Letn = —m, with m € N. Then, 


fx) —f(%0) | ces =) 


X— Xo X—Xo \x”"™ Xx 


Hence, Proposition 9.7 and (9.6) furnish 


: . 1 x” — xg) ; 1 . x” — xg) 
fo) = — im — = — lim - lim 
x>axo xq LX — XO x>xo XM XG x>x0 \ xX — XO 


Now, applying the result of item (b) (with m in place of n), we get 


1 
YU -_ m=1 —. —m—-1 __ n—1 
f @) = 7 -MXxX9 = —MXp =X. 
0 


oO 


Example 9.22 The sine and cosine functions are differentiable, with sin’ xy = cos xo 


and cos’ x9 = — sin xo for every x9 € R. 


Proof Let’s do the proof for the sine function, the proof for the cosine function 


being completely analogous. 
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It follows from the product formulas of Trigonometry that 


sin(xo +h) —sinxo _ sin 4 h 
FE oO ae : 


A 3 


Since the cosine function is continuous, Proposition 9.7, (9.6) and the fundamental 
trigonometric limit (cf. Lemma 9.11) give 


ac ; sin 4 h 
sin xX) = lim -cos{ xo + = 
h>0 3 2 


2 4: h 
ak lim cos { x) + = 
h>0 5 h>0 2 


. sinh 
= lim — -cosxp = cos Xo. 
hoo A oO 
Remarks 9.23 Before we proceed with the development of the theory, let’s pause to 
make two useful remarks. 


1. Since derivatives are limits, given a function f : [a,b) — R we can consider the 
right-handed derivative of f at a, i.e., the one-sided limit 


Ps in OO 
+ a ’ 


x—at x—-a 


provided it exists. Accordingly, for f : (a,b] — R we can consider the left- 
handed derivative at b, i.e., the limit 


0) = tim SOTO) 


x—>b— x—b 


(also as before, provided such a limit exists). Moreover, whenever we say that 
a function f : [a,b] — R is differentiable, we shall implicitly assume that 
its derivatives at x = a and x = b are one-sided ones, and shall write simply 
f' (a) and f’(b) (instead of f(a) and f’ (b)) to denote them. From now on, such 
conventions will be in force, without further comments. 

2. As in Sect. 9.1, we shall sometimes denote the derivative of a function / at a point 
xo by using Leibniz’s classical notation g (xo), so that 


f@)—FG0) 


x>xX = X— Xo 


The definition of derivative, together with the heuristic discussion of Sect. 9.1, 
allows us to present another important 
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Fig. 9.6 Tangent to the graph 
ofxtre + 


Definition 9.24 Let J be an interval and f : J > R be a function differentiable at 
Xo € I. The tangent line to the graph of f at the point (xo, f(xo0)) is the line that 
passes through this point and has slope f’ (xo). 


If f : 1 > R is differentiable at xo, it follows from the definition above and 
elementary analytic geometry (cf. Chap. 6 of [4], for instance) that the tangent line 
to its graph at (xo, f(xo)) has equation 


y —f(%0) =f’ %o)(% — x0). (9.14) 


Example 9.25 Let H be the portion of the curve xy = 1 contained in the first 
quadrant of a cartesian plane,’ and P and Q be points along the axes, such that 


line PQ is tangent to H at the point A (cf. Fig. 9.6). Prove that: 
(a) AP = AQ. 


(b) If O is the origin of the cartesian plane, then the area of triangle POQ doesn’t 
depend on the position of A along H.. 


Proof It is clear that H coincides with the graph of f : (0,++co) — R, given by 
f@wm= t. If x9 > 0 and A is the point (xo, f(x0)) = (xo, >), then the line tangent 
to G; at A has equation y — ~ = f’ (xo) (x — xo). Example 9.21 gives f’(xo) = —2: 
so that the equation of the tangent line can be written as y — 7 — —aG — Xo) OF, 
which is the same, xpy + x = 2x0. 

Making x = O and then y = 0 in such an equation, we get P(o, 2) and 


Q(2xo, 0), or vice-versa. In any case, items (a) and (b) follow immediately: 


Such a portion is a branch of an equilateral hyperbola (cf. Chap. 6 of [4], for instance), but we 
shall not use this fact. 
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(a) It’s a well know fact (again, see Chap. 6 of [4], for instance) that the midpoint of 
PQ has abscissa and ordinate respectively equal to 5 : = = = and 5 - 2x9 = Xo. 
Hence, such a point coincides with A. 

(b) Since POQ is a right triangle, we have 

2 


——— 
A(POQ) = 5 OP 00 = Ixy = 2. 


oO 


Continuing with the analysis of the concept of derivative, let’s show that 
differentiability is stronger than continuity. 


Proposition 9.26 [f a function f : I —> R is differentiable at x9 € I, then f is 
continuous at Xo. 


Proof Note first that, for x € I \ {xo}, we have 
F) = Flo) + (PAZ) 6x — 
X— Xo 


On the other hand, since we are assuming the existence of lim,,,, forieo) = 
f’ (xo), it follows from Proposition 9.7 that 


Him (0) = fim (Fox) + (=A) 4 — 9) 
xX>Xx9 xXx X— Xo 


= Fo) + fim (A=ZO) 


lim (x — x0) 


x— Xo X—>X0 


= f (Xo) +f’ (%o) 0 = f (Xo). 


Therefore, Proposition 9.4 assures the continuity of f at xo. oO 


The coming example shows that there exist continuous functions which are not 
differentiable in an arbitrary finite set of points of their domains.° 


Example 9.27 Given distinct a}, d2,...,@, € R, the function f : R — R such that 
f= pee |x—a,| is continuous, though not differentiable at any of a1, a2, ..., Gn. 


Proof The given function is obviously continuous. For what is left to do note that, 
for a fixed 1 < k < n, we have 


f(x) = k- al + YO |x ai. 
1<j<n 


i#k 


3Later, in Chap. 11, we shall see an example of a continuous function f : R — R that is not 
differentiable at any point. 
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and in an appropriate neighborhood of a; each summand |x — aj|, with j 4 k, equals 
x — aj or aj — x. Therefore, it suffices to show that x +> |x — a,| is not differentiable 
at dx. 

For the sake of notation, write f(x) = |x —al|, with a € R. Then, for x 4 a, we 
have 


f@-f@ _k-al_{ 1, ifx>a 


x-a x-a -l, ifx<a’ 
Hence, limy—+g+ {@f@ fs) =f @ — { and limy—+q— Baie) = —1, so that Problem 5, 
page 290, assures that lim, fa-fo does not exist. oO 


If 7 C R is an interval and f : J — R is differentiable at each x) € J, we shall 
simply say that f is differentiable function. In this case, the derivative function 
f' : 1 = R of f associates to x € J the derivative f’(x) of f at x. 

If J and J are intervals and f : J — J is a differentiable bijection, the last 
proposition guarantees that f is continuous. Moreover, Theorem 8.35 assures that 
f—! : J Tis also continuous. Our next result explains when f—! is differentiable. 


Theorem 9.28 Let I and J be intervals and f : 1 > J be a differentiable bijection. 
For xq € I and yo = f (xo) € J, we have f—! : J + I differentiable at yo if and only 
if f' (xo) 4 0. Moreover, if this is so, then 


(f Yoo = a5 a (9.15) 


Proof For h sufficiently close to (but different from) 0, let x = x9 +h, 1 = f(xo + 
h) —f (xo) and y = yo + l. Then, y = f(xo) + / = f(xo + A), so that 

h= (xo +h)—x0 =f 0 +) —f7' 00). 
Hence, the continuity of f and f—! guarantee that h > 0 + 1 > 0. Also, notice that 


(fet = fen) (owt p—F on) _ 
i / “hil 


=1. 


Now, suppose that f—! is differentiable at yp. Making h —> 0 (which, as we have 
just seen, is equivalent to making / — 0), it follows from item (b) of Proposition 9.7 
and the computations above that 


ail f (xo +h) —f(x0)\ (fF ' Oo +9 —F 7! (0) 
1 = lim — — 
h>0 h l 
_ a (F000 +h) —f0)\,.. (F-' Wo + D — F700) 
= im (| ———_ } lim |§ —— 
h>0 h 10 1 


= f'(xo)(f-'Y'00). 


Therefore, f’(xo) 4 0 and (f—!)/(yo) = Foy" 
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Conversely, suppose that f’(xo) 4 0. Since 


F700 +) = F700) _ 1 
Fo +h) —F(0))/h" 


item (c) of Proposition 9.7, together with the fact that / > 0 < h — 0, furnishes 


i (Corda ow) a ( I ) 
im | ——_ ]_ = Lim | ————— 
0 i 10 \(F(x0 +h) —f(x0))/h 


1 1 
= lim [ee aoe 
h>0\ (F%o +h) —f%o))/A} — f"o) 
Therefore, f—! is differentiable at yo. B| 


Example 9.29 Letn > 1 be an integer and f : [0,-+co) — [0,+00) be the n-th 
root function, i.e., f(x) = 2/x = x!/". Then, f is not differentiable at x = 0 but is 
differentiable at every x > 0, with f’(x) = 4-x!/""!. 


Proof Recall that f = g~!, where g : [0, +00) — [0, +00) is such that g(y) = y”. 
Hence, it follows from the previous result that f is differentiable at x = g(y) if 
and only if g’(y) 4 0. However, since g/(y) = ny""!, we have f differentiable at 
x = g(y) if and only if y ¥ 0 or, which is the same, x # 0. In this last case, recalling 
thatx = g(y) =y" & y = Ww, (9.15) gives 


1 1 1 1 
f(x) _ — — = .xl/nrl 
sty) nyt nay ton 


oO 


For an important generalization of the previous example, see Problem 4, 
page 310. 

We close this section by presenting the concept of higher order derivatives. To 
this end, let be given an interval / and a function f :/ > R. 

We say that f is twice differentiable at x) € / if there exists a neighborhood 
(xo — r,Xo + r) of xo such that f is differentiable in 7M (xo — r,xo + r) and f’ : 
IN (% —1r,x%0 + 17r) > R is differentiable at xo € /. In this case, we also say that 
(f’)'(xo) is the second derivative of f at xo, and write f’(xo) = (f’)’ (xo). If f is 
twice differentiable at each x9 € J, we shall simply say that f is twice differentiable 
in J, and let f” : J + R denote its second derivative function. 

More generally, let k € N and suppose we have already defined what is meant 
by the k-th derivative f(xy) of f at xo € JI. If f is k times differentiable in 
I (ie., if f (xo) exists for every x9 € 1), we let f : 1 — R denote the k-th 


9.3. Basic Properties of Derivatives 299 


derivative’ function of f. If f is k times differentiable in a neighborhood (x —r, x9 + 
r) of xp and f® : IM (x) —1r,x0 +r) > R is differentiable at xo, then we say that f 
is k + | times differentiable at x), and write 


fe) = FY) 


to denote its (k + 1)-th derivative at xo. 

If f : 1 > Risk times differentiable in J and f“ : J > R is continuous, we 
shall say that f is k times continuously differentiable (or simply continuously 
differentiable, if k = 1) in /. Finally, if f is k times differentiable in J for every 
k EN, then we say that f is infinitely differentiable in /. 


Example 9.30 Ifn € Z\ {0} andf : R — Ris given by f(x) = x", it easily follows 
from item (b) of Example 9.21 and Problem | that f is infinitely differentiable in 
R \ {0} Gn R, if n > 0), with 


f@) =n(n-1)...2—k + 1) 


for every k € N. In particular, if n > 0 and k > n, then f vanishes identically. 


Example 9.31 Functions sin, cos : R — R are infinitely differentiable, with 


(4k4+-2) _ (4k+3) _ 


sin“) — sin, sin’t+) — cos, sin —sin, sin = — cos 


and 


(4k4+2) _ 


cos“) = cos, cost) = — sin, Cos — cos, cos@k+3) = sin, 


for every k EN. 


Problems: Section 9.3 


1. * Let 7 C R be an interval and f : J] > R be differentiable at x) € J. For a fixed 
c € R, prove that: 


(a) Function g : J > R given by g(x) = cf(x) is differentiable at xo, with 
8 (x0) = cf" (0). 

(b) Function h : J > R given by h(x) = f(x) + c is differentiable at x9, with 
h'(xo) = f' (0). 


4Since f is also used to denote the composition of f : J > I with itself, k times, we will rely on 
the context to clear any danger of confusion. 
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The next problem establishes a simple particular case of the chain rule (cf. 
Theorem 9.19). 


. * Let f : (a,b) > R be differentiable at x) € (a, b). For c > 0 (resp. c < 0), let 


g: (4, a) — R (resp. g: (2, a) — R) be the function given by g(x) = f(cx). 
Show that g is differentiable at *2, with g’ (*2) = cf’ (x0). 


Cc 


. * Let J be an interval and k € N. If f,g : J > R are k times differentiable in 


xo € I, prove that f + g and fg are k times differentiable at xo. 


. * In Example 8.37, we defined arcsin : [—1,1] — [-4, 4] and arccos : 


[—1, 1] — [0, x] as the inverses of sin : [-5, 4] > [-1, 1] and cos : [0,7] > 
[—1, 1], respectively. Use Theorem 9.28 to prove that the restrictions of arcsin 


and arccos to the open interval (—1, 1) are differentiable, with 


1 
arcsin’ x = and arccos’ x = — 


V1 — x2 V1 — x2’ 


para todo x € (—1, 1). 


. Compute the indicated derivatives (in each case, assume that the domain of the 


corresponding function is the maximal one): 


(a) f'(); f@) = a. (e) we f(x) = mee 
b) f’(2); — ve (f) f’ (4); £@) = 4cos (4). 
rs a pe - ae (g) f’(4); f(@) = arcsin(2x). 
(d) fl): Ff) = 2x. (h) f’(4); f(&) = 2 arecos(3x). 


. Let f,g : (a,b) > R be differentiable functions whose graphs pass through a 


given point A. We say that these graphs are tangent to each other at A if their 
tangent lines at A coincide. In this respect, do the following items: 


(a) Show that, for m,n € N, the graphs of f(x) = x" and g(x) = x” are tangent 
to each other at (0, 0), but are tangent at (1, 1) only if m =n. 

(b) Let f : (a,b) > (a,b) be a differentiable bijection such that f! is 
also differentiable. If the graphs of f and f—! are tangent to each other at 
A(X, yo), compute the possible values of f’(xo)f’ (yo). 


. Let C R be an interval and f : J > R be continuous at xo ¢€ J. If the function 


x +> xf (x) (« € J) is differentiable at x9, prove that f is also differentiable at xo. 


. * Let J be an open interval, x9 € J and f : J > R be differentiable at x9, with 


f' (xo) = L. Given sequences (d,),>1 and (bn)n>1 in I, with ay < x9 < by, for 
every n => 1, show that 


li f(bn) =f (an) _ 
0 0 0 a 


n>+oo by—dn 


L. 


9.3 


9. 


10. 


11. 


12. 
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Given a differentiable function f : R — R, we say that a real number Xp is a root 
of f provided f(xo) = 0. Newton’s method? for the computation of numerical 
approximations of the roots of f guarantees that, under certain conditions and 
for an appropriate choice of a € R, the sequence (a,),>1 defined by a; = a and 
La) for n > 1 (of course assuming f’(a,) 4 0 for every n > 1) 


f' (an) 
converges to a root of f. In this respect, do the following items: 


Gn+1 = an — 


(a) Show that a,+4, is the point at which the tangent line to the graph of f at 
(an, (dy)) intersects the horizontal axis. 

(b) Given a > 0, explain how Newton’s method make it natural the definition 
of the sequence (a,)n>1 of Problem 19, page 220. 


Given R > 0, let f : (—R, R) — R be given by f(x) = VR? — x2. Prove that: 


(a) The graph of f is the upper semicircle of radius R, centered at the origin of 
the cartesian plane and with diameter along the horizontal axis. 
(b) For xo € (—R, R), we have f’ (xo) = ——2 


(c) Given x) € (—R,R) and A(x, f(xo)), the tangent line to the graph of f, 
obtained according to Definition 9.24, coincides with the straight line that 
passes through A and is perpendicular to the radius OA. 


Let J be an open interval and f : J — R be a differentiable function. Given 
a,b € R, suppose that there exists a straight line r, tangent to the graph of f and 
passing through the point (a, b). If (xo, yo) is the point at which r is tangent to 
the graph of f, show that 


yo =f (Xo) 
yo — b = f'(x0)(X%0 — a) © 


Then, let r and 7’ be the straight lines that pass through the point (1,—1) and 
are tangent to the parabola y = ae use the system above to find the points at 
which r e r’ touch the parabola. 

For the next problem, the reader may find it helpful to read the proof of 
Theorem 6.62 again, as well as to review the elementary facts on conics (cf. 
Chap. 6 of [4], for instance). 

Let P be the parabola with focus F and directrix d. If P is a point on d and 


—_ —_ 
A,B €P are such that AP and BP are tangent to P, show that F € AB. 

For the next problem, the reader may find it convenient to have at his/her 
disposal an extension of the definition of lim,_,,, f(x) to the case of a function 
f : X — R, where X C R is a nonempty set and xo € R is the limit of a 
sequence of distinct elements of X (for instance—and this will essentially be 
the case of our interest —, we can have X = Q and x = fD), Fortunately, 
such an extension is formally identical to the one we have already met: we say 


>We shall have more to say on Newton’s method in Problem 7, page 455. 
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that lim,_,,, f(x) exists and is equal to L if, for every « > 0, there exists 5 > 0 
(depending on €) such that 


x €X and 0 < |x—xo| < 6 > [f(X) -L| <e. 


13. (OIMU) Let f : (0, +00) — R be given by 


_ j)0, ifx¢Q 
EO 4, if x= 2 with p,q €N; ged(p,q) = 1° 


If k € N is not a perfect square, show that f is differentiable at x = /k. 


9.4 Computing Derivatives 


This section is devoted to the presentation of the usual differentiation rules, 1.e., 
formulas that show how to compute derivatives of certain functions constructed in 
terms of two given differentiable functions. 

For item (c) of the coming proposition, we recall that if g : J > R is 
differentiable at x) € J, then g is continuous at x9; therefore, if g(xo) ~ O, then 
there exists an interval J C J, containing xo and such that g 4 0 in J. Thus, we think 
of f as defined in such a J. 


Proposition 9.32 If f,g: I — R are differentiable at x9 € I, then: 


(a) f + g is differentiable at xo, with (f + g)' (xo) =f" (xo) + g' (x0). 
(b) fg is differentiable at xo, with (fg) (xo) = f' (x0) g (x0) +f 0) g" (x0). 
(c) If g(xo) # 9, then © is differentiable at xo, with 


f\ ... _ f’ Gogo) —fX0) 8 Xo) 
+) @ = eee 
& 8 (Xo) 
Proof 
(a) Let us prove that f + g is differentiable at xo, with (f + g)/(xo) =f’ (x0) + 2’ (0) 
(the claims relative to f — g can be proved in quite analogous ways). Since 
(f+ 8)@) =O + 8)(%0) _ f@) =f 0) | 8) = 80) 


X— Xo X— X09 X— Xo 


for x € I \ {xo}, the arithmetic properties on limits of functions guarantee that, 
if f’ (xo) and g’ (xo) exist, then (f + g)/(xo) also does and 
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(eG = in 2 Ow 


xX—>x0 xX — X09 
= lim (= — f (xo) m B(x) = sv) 
x>xo Xx —X9 x— Xo 


ait f(x) — fo) tice g(x) — g(x) 


xX—>Xx0 X= Xo X>X0 xX — X09 
= f' (x0) + 8’ (0). 


(b) Firstly, note that 


(fg)() — (fg) (0) _ (“ =/) (& = 80) 
=r = (Jew + ( 


X—X0 


) £60) (9.16) 


xX —X0 X— Xo 


Now, since g is differentiable at xo, it follows from Proposition 9.26 that g is 
continuous at xo, so that 


lim g(x) = g(%o). 


Therefore, letting x —> xo in (9.16) and applying the arithmetic properties on 
limits of functions, we get 


fim EICOV= PCD) jig FOV= LOO on (a 
x—>x0 xXx— Xo X—>Xx0 x— Xo xX—>x0 
(x) — 80) 


+ f(xo)- lim 
xx X— XO 


=f (X0)g (xo) + f(%0)g’ Bo). 


(c) Let us first consider the case in which f is constant and equal to 1, and let us 
/ / 
show that (4) (xo) = =e Since g(x) # 0 for x € J C J, we have for 
x € J \ {xo} that 


—(z- : )-- (=). 

x—x0 \g(x) — g(Xo) x— Xo g(x) 8 (xo) 

From the equality above and invoking the differentiability (and continuity) of g 
at xo, we get 


ae ( : ! ) (xo) 
————————— — xX 2 aie 
x>x0 x—X9 (g(x) —_g(X0) a (x0)? 
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For the general case, we use the particular case above, together with the result 


of (b): 
(4) a= (r- ~) Gi) =F) (<) eiiges (=) oy 
& & & & 


£0) _ pe), 8 G0) _ F Goda) = Fla0)s' eo) 
a0) BQ)? (Xo) 


Example 9.33 Let f : R — R be the polynomial function given by 
f(x) = nx” + an x! ++++ + ax + ao, 


with ao, d1,...,@n € Randa, 4 0. Applying Example 9.21 and item (a) of the 
previous proposition several times, we conclude that f is differentiable, with 


f@= Ndnx” | + (n— Dap" ++-++2a.x+ a, 


for every x € R. Then, f’ : R — R is also polynomial, and an easy inductive 
argument shows that f is indeed infinitely differentiable, with ft) vanishing 
identically. 


For the next example, recall from Trigonometry that the secant function sec: 
R\ {5 +k; k € Z} — Ris given by secx = —. for every real x in its domain. 


Example 9.34 If D= {5 +k; k € Z}, then the tangent function tan: R\D > R 
is differentiable, with tan’ x = sec” x for every x € R \ D. Indeed, item (c) of the 
previous proposition, together with the formulae deduced in Example 9.22, furnish 


, _ sin’xcosx—sinxcos’x  cos?x + sin? x i P 
cos? x cos? x cos? x 


We now show that the arc-tangent function (cf. Example 8.37) is differentiable 
and compute its derivative. 


Example 9.35 Function arctan : R > (—4%, 3) is differentiable, with 


arctan’ y = 
eS T+? 


for every ye R. 


Proof First of all, observe that tan’ x = sec? x # 0 for every x € (4. z), Hence, 
it follows from Theorem 9.28 that arctan is differentiable. Moreover, if y € R and 
x = arctany, then y = tan x and, thanks to (9.15) and the result of the former 


example, we get 
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1 1 1 


arctan’ y = ——— = ——— = ——__ = —___. 
y= ‘tan’x  sec2x l+tan?x 1+y 


oO 


The most important of all differentiation rules is the chain rule. Loosely 
speaking, it establishes the differentiability of the composite of two differentiable 
functions, and shows how to compute such a derivative. 

More precisely, let J and J be intervals and g : 1 > J andf : J — I be given 
functions, with g differentiable at x) € J and f differentiable at yo = g(xo) € J. 
Suppose that, in some neighborhood J MN (x) — r, x9 +r) of xo, equation g(x) = yo 
has xo as its only root. Then, for x € 7M (xp — r, xo +r), we can safely write 


(fo 9G) — Fe gio) _ M8) ~ f(g) | 8@) — go) 


x — Xo g(x) — g(%o) x — Xo 


Since g is continuous at xo, we get lim,+x) g(x) = g(%o), so that 


fan LoME= Veale) _ 4. Hels) Fel) 5. 6) — 00) 
xX>x0 X— Xo xX>x0 g(x) = g (x0) xX>X0 X— Xo 
= tim LOL) 95) = Pade") 
yoy =y—yo 
= f'(g(x0))8' (0). 


As we shall see in Theorem 9.19, this formula continues to hold in the general 
case, albeit the proof is much more involving, for, g(x) = yo can have infinitely 
many solutions in any neighborhood of xo (for instance, think of g as the function f 
of Problem 11, and solve the equation g(x) = 0). Hence, we need to develop some 
preliminaries. 

Let J C R be an open interval and f : J > R be differentiable at x9 € J. Set 


IT-x = {he Rxwthelh, 
so that J — xo is an open interval containing 0. Define r : J — x9 > R by letting 
r(h) = f (Xo +h) —f (x0) —f' (xo)h. 
Since f is continuous at x9, we have 


lim r(h) = lim (f(&o +h) ~ f%0) —f' Goh) = 0 = (0), 


so that r is continuous at 0. Also, the definition of f’ (xo) gives 


h>0 h h>0 h 


tim = tim j-— al A F'¢0)) =0. 
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(This last computation, together with r(0) = 0, actually shows that r is differentiable 
at 0 with r’(0) = 0. Nevertheless, we won’t use this fact here.) 
Conversely, we have the following auxiliary result. 


Lemma 9.36 Let I C R be an open interval, f : I > R be a given function and 
xo € I. If there exists a real number L such that the function r : I — x9 — R, given 
by r(h) = f(%o +h) — fo) — LA, satisfies the condition limp+o ae) = 0, then f is 
differentiable at xo, with f’ (x9) = L 


= — feoth)f(xo) rh 
h 


Proof Since — L, the condition lim;,—+o = WM) — dis equivalent to 
h)— 
im 122+ )= Fo) _ L. 


h>0 h oO 


Let J be an open interval and f : J — R be a function continuous at x9 € J. 
For every L € R, the graph of the affine function h + f(xo) + LA passes 
through (x, f(xo)). This being said, it is frequently useful to rephrase the previous 
discussion by saying that f is differentiable at xo if and only if f admits a best affine 
approximation in a neighborhood of xo, in the sense of Lemma 9.36. 

In other words, letting f : J > R be differentiable at x9 € 7, we have 


FX +h) = f%0) +f’ Goh + r(h), (9.17) 


with r : 1—xo — R such that lim;_.o = r() = (. On the other hand, if 1 FS (xo) + Lh 
is an affine approximation of f in a fetakberhoad of x, such that setting r(h) = 
F(xo + A) — fo) — Lh we have lim)_, A = 0, then f is differentiable at x) and 
L= f' (xo). 

We shall sometimes refer to (9.17), valid for all h € I — xo, as Taylor’s formula 
of order 1 for f in a neighborhood of xo. Notice that its real content is captured by 
the fact that lim,9 = Ao = O if f is differentiable at xo. 

For what comes fect, it’s more useful for us to write (9.17) as 


f (xo +h) = f(X0) +f’ Co)h + RUA)A, (9.18) 
with R : 1—xo — R given by 


rh) . 
R(h) = pr Reo 
0, ifh=0 
Notice that the continuity of r implies that of R at every h € I — xo different from 0, 
whereas condition limyp_, a = 0 assures that R is continuous also at 0. 
Conversely, if f can be written as 


f(%o +h) = f(%o) + LA+ R(A)A 
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in J — xo, with R being continuous in J — xp and such that R(O) = 0, then a slight 
modification of the proof of Lemma 9.36 gives that f is differentiable at x9, with 


f'(o) = L. 


We can finally state and prove the chain rule in the general case. 


Theorem 9.37 (Chain Rule) Let J and J be open intervals and g : I — J and 
f : J — R given functions. If g is differentiable at x9 € I and f is differentiable at 
g(xo) € J, then f og: I > R is differentiable at xo, with 


(fF © 8)' (Xo) = f’(g(x0))g" (0). (9.19) 


Proof Let yo = g(xo). According to (9.18), the differentiability of f at yo allows us 
to write 


fo +) =f) +f Oo)t + s(t (9.20) 


for every t € J — yo, with s : J — yp — R continuous and such that s(0) = 0. 
Analogously, the differentiability of g at x9 furnishes, for h € I — xo, 


(xo +h) = g(xo) + 8’ (oh + rh, (9.21) 


with r : J — x9 — R continuous and such that r(0) = 0. 
Now, for h € I — xo, it follows from (9.21) that 


(f 0 g)(xo + h) — (fog) G0) = f(go + A) — f(g) 
= f(g(Xo) + 8 (o)h + rh) — fo) 
=f + t(h)) — fo), 


where t(h) = g’(xo)h + r(h)h. Hence, for h € I — xo, it follows from this and (9.20) 
that 


(fo g)(xo0 + h) — (Fo g)%0) = f(yo + th)) —f 0) 
=f’ (vo)t(h) + s(t(h))t(h) 
= f'(yo)(g' Boh + r(A)h) + s(t(h)) th) 
= f'(g(%0))8' (oh +f Yo)r()h + s(t(h))t(h) 
= f'(8(%0))g’(xo)h +f’ (vo)r(A)h 
+ s(t(h))(g' (xo) + r(h))h 
= (f 0 g)(x0) = f’(g(%0))g’ o)h + R(A)A, 


where R(h) = f’(yo)r(h) + s(t(h))(g" (xo) + r(h)) for h € I — x0. 
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Finally, the discussion at the next to last paragraph before the statement of 
the chain rule guarantees that, in order to show that f o g is differentiable at xo 
and that (9.19) is true, it suffices to prove that R is continuous at J — xo, with 
R(O) = 0. But this is straightforward from the chain rule for continuous functions 
(cf. Proposition 8.12), together with the fact that r, s and f are continuous, with 
r(0) = s(0) = ¢(0) = 0. Oo 


Example 9.38 The chain rule allows us to compute the derivative of the cosine 
function from that of the sine unOn, Indeed, since cosx = (sinog)(x), where 
: R > Ris given by g(x) = 5 — x, it follows from the chain rule that 


cos’ x = (sinog)’(x) = sin’ g(x) - g(x) = cos g(x) - (-1) 


ges 
= —cos{— — = —sinyx. 
2 


Example 9.39 Let R > 0 and f : (—R,R) > R be given by f(x) = VR? — x’. 
Define g : (0, +00) > Randh: (—R,R) > (0, eo) by letting g(x) = ./x and 
h(x) = R? — x’, so that f = goh. Since g'(y) = zy for y > 0, the chain rule gives, 
for x € (—R, R), 


f'@) = (Boh) = B'(h@))'@ = *(—2x) = 


1 x 
2/h(x) Rx 
A faster way of using the chain rule to compute the derivative of f at x € (—R, R) 


is this: first write f(x)? +x? = R?, then differentiate both sides with respect to x (and 
with the aid of the chain rule) to get 2f(x)f’(x) + 2x = 0; hence, f’(x) = — 


x 


[R2— 2° 


oi 
f@ ~~ 


Example 9.40 We can use the chain rule, together with the formula for differen- 
tiation of a product, to establish the differentiation formula for a quotient. To this 
end, let f, g : 1 > R be differentiable in J, with g(x) 4 0 for every x € J. Letting 
t : R \ {0} — R denote the inverse proportionality function, so that t(x) = x~! for 
every X re 0, it follows from item (c) of Example 9.21 that t is differentiable, with 
t'(x) = —x? for every x € R \ {0}. Now, notice that us = f(x) - (t ° g)(x) for 
every x € J, so that the formula for differentiation of a Sroduet and the chain rule 
give 


(5) (x) =f'@)- (Fo g)@) +f@)- (0 8)'@) 


=f’) + f£@)-1'(g@)8'@) 


a 
ara on ; + f)(—e(a))e"(x) 


_ fs) ~ fs" (x) 
g(x)? 
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The coming example is considerably more sophisticated than the previous ones. 
Example 9.41 Prove that f : R > R, given by f(x) = sin(x’), is not periodic. 


Proof First of all, note that if f : R — R is continuously differentiable and periodic, 
then its derivative f’ : R — R, besides being continuous, is also periodic. Indeed, 
if f(x) = f(x + p) for some real p > 0 and every x € R, then the chain rule gives 
f'(x) = f'(«+ p) for every x. Therefore, 


Im(f’) = Im(fi0,,)- 


and the continuity of f’, together with Corollary 8.30, guarantees that f” is a bounded 
function. 

Now, letting f be the given function and g : R > R be given by g(x) = x”, we 
have f(x) = (sinog)(x), so that the chain rule furnishes 


f' (x) = sin’ g(x) - g’ (x) = 2xcos(x’). 


Hence, if f(x) = sin(x*) were periodic, the discussion at the preceding paragraph 
would guarantee that f’ would be bounded. Since this is obviously false, we have 
reached a contradiction. oO 


The following corollary of the chain rule will be of interest later. 


Corollary 9.42 Let I and J be open intervals and g: I > J andf : J > R given 
functions. If g is n times differentiable at xy € I and f is n times differentiable at 
g(xo) € J, then f og: I > Risn times differentiable at xo. 


Proof We make induction on n, relying on the chain rule for the case n = 1. Assume 
the corollary to be true when n = k, and let f and g be k + 1 times differentiable 
at Xo. 

In a neighborhood of x9, we have (f 0 g)’ = (f’ 0 g)g’, with f’ and g’ being k 
times differentiable at that point. The induction hypothesis guarantees that f’ 0 g is k 
times differentiable at xo, from where Problem 3, page 300 (applied to f’ o g and g’) 
assures that (f o g)’ = (f’ 0 g)g’ is also k times differentiable at xp. Therefore, f © g 
is k + 1 times differentiable at xo. oO 


Problems: Section 9.4 


1. Compute the derivatives of the given functions, assuming in each case that the 
function and its derivative are defined in their maximal domains: 


(a) f(x) = 23 — 2x2 47x +1. ©) f=4+4+4 
(b) f(x) = xe/a + x4. (d) f(x) = cosx + tanx. 
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(e) f(x) = 2° sin x. () f@) = SH 
(f) f%) = 7./xX" + 4). (h) f(x) = Es. 


. Let J be an interval and f : J + (0,+00) be differentiable at x9 € J. If g 


I —> R is given by g(x) = ~/f(x), show that g is differentiable at xo, with 
Le 
8'(xo) = fo)" fC). 


. Compute the derivatives of the given functions, assuming in each case that the 


function and its derivative are defined in their maximal domains: 


(a) f(x) = (cos.x)?. (e) f(x) = aresin(1 — x”). 
(b) f(x) = cos(x). (f) f(@®) = ence) 
Chie ane we 

(d) fQ@%) = v1 —xcos(x’). (h) f(x) = CCE 


. Letr be a nonzero rational number andf : (0, +00) — R be given by f(x) = x’. 


Prove that f is differentiable, with f’(x) = rx’—! for every x > 0. 

For the next problem we recall a few facts on roots of polynomial functions. 
Given a polynomial function f with real coefficients and degree n > 1, anda 
real root a of f, the division algorithm for polynomials (cf. Chap. 14 of [5]) 
guarantees the existence of an integer 1 < k < n and a polynomial g of degree 
n — k such that 


F(x) = (x— a)‘ g(x), (9.22) 


with g(a) # 0. In this case, we say that & is the multiplicity of a as a root of f. 


. Let f : R — R be a nonconstant polynomial function. Show that: 


(a) If @ is a root of f and f(x) = (x — a)*g(x), with g(a) ¥ 0, then, for every 
x € R such that f(x) 4 0, we have 
ra) k_, ¥@ 
iQ) a-a eG) 


(b) If f(x) = aw —a))(x— a2)...(%—@,), with a,a),...,@, € Randa ¥ 0, 
then, for x # a@,...,@n, we have 


f'@) _ “. 1 
f(x) Perr 


j=l 


. Iff : R \ {3} > Ris given by f(x) = =, compute f© (0). 
. Let f : [0, +00) > R be given by 


x= 3:7 


f(x) = xt Yuxtee + Vxtvx41 


with ten square root signs. Compute f’(0). 


9.4 


10. 


11. 


12. 
13. 


14. 
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. Fork €N, show that arctan (x) = iQ) where Px is a polynomial of degree 


C+2)F? 
k — 1 and leading coefficient (—1)*~!k!. 


. Find the common tangents to the parabolas of equations y = x? and y = 2 + 


(x — 3)’. 

For the next problem, the reader may find it helpful to review the elementary 
material on conics, at the level of Chap. 6 of [4], for instance. 
Let € be the ellipse of equation = + y= = | and H the hyperbola of equation 


2 
2 


a x = |. We say that € and H are confocal if they have the same foci. In 
this respect, show that: 


(a) a? -P =a? +b’. 
(b) € and H intersect in four distinct points; moreover, if (xo, yo) is one of them, 
then 


2 1 1 _ 1 1 2 1 1 _ 1 1 
Xo ab? as a2b2 } be - b2 and Yo ab + a2b2) qe i ae 


(c) If P is one of the intersection points of item (b), then the straight lines 
tangent to € and H at P are perpendicular. 


Do the following items: 


(a) If f : R > Ris given by 


x? sin + 1 ifx#0 
0: 4x =0 


’ 


f(x) = 


show that it is differentiable at every real number and that f’ is not 
continuous at 0. 

(b) For real numbers x; < x. < ... < X,, give an example of a differentiable 
function g : R > R such that g’ is discontinuous at x, x2, ..., Xp. 


Prove that f : (0, +co) > R, given by f(x) = sin ./x, is not periodic. 
(Putnam) Let ay, do, ..., d, be given reals and f : R — R be such that f(x) = 
a, Sinx + a2 sin(2x) + --- + a, sin(nx) for every x € R. If |f(@)| < | sinx| for 
every x € R, prove that |a; + 2a2 +---+na,| < 1. 

(OBMU) Given nonzero real numbers a1, d2,..., @n, Show that the function f : 
R — R, such that 


f(x) = > ajcos(jx) 
j=1 


for every x € R, has period 27. 
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15. * Prove the following version of l’H6pital’s rule®: let f, g : [a,b) > R be two 
functions differentiable at a and such that f(a) = g(a) = 0. If g # 0 in (a,b) 
and g’(a) 4 0, show that 


aml. -£@. 
ag@) g(a) 


9.5 Rolle’s Theorem and Applications 


Let f : [a,b] — R be continuous in [a, b], differentiable in (a,b) and such that 
f(a) = f(b) = 0. In addition, suppose that f is nonzero in at least one point of the 
interval (a, b). Arguing heuristically, translate the horizontal axis parallel to itself, 
until it becomes tangent to the graph of f at a point (c,f(c)) (cf. Fig. 9.7). Then, on 
the one hand, such a tangent line must have slope f’(c); on the other, being parallel 
to the horizontal axis, its slope must be equal to 0, so that f’(c) = 0. 

The first result of this section, which is known in the literature as R6lle’s 
theorem,’ puts the discussion of the previous paragraph in solid grounds. 


Lemma 9.43 (Rolle) Let f : [a,b] — R be continuous in [a, b] and differentiable 
in (a, b). If f(a) = f(b) = 0, then there exists c € (a,b) such that f’(c) = 0. 


Proof Let c,d € [a, b] be the points in which f attains its minimum and maximum 
values, respectively. If c,d € {a,b}, then for every x € [a,b] we have 0 = f(c) < 
f(x) < f(d) = 0. Therefore, f vanishes identically in [a, b] and there is nothing left 
to do. 

Else, suppose that c € (a,b) (the case of d € (a,b) can be dealt with 
analogously). Since f(x) > f(c) for every x € [a, b], we conclude that 


Fig. 9.7  Rélle’s theorem 


® After Guillaume F. Antoine, Marquis de I’Hépital, French mathematician of the XVII century. A 
more refined version of 1’ H6pital’s rule will be presented at Proposition 9.50. 


7 After Michel R6lle, French mathematician of the XVII century. 
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x € (c, b] => ——— = 0 and Pe, <0. 
a x-—C 


f@)—-fO 
Cc 


Hence, it follows from the results of Problems 2 and 5, page 290, that 


fice. pes TOA O) 
f(c) = lim = 


x—>c+ x 


> 0 and f’(c) = lim feO=fo) <0. 


>c- XC 
Thus, f’(c) = 0. Oo 
The next example shows a typical application of Rélle’s theorem. 


Example 9.44 Given a,b,c € R, show that the equation 4ax? + 3bx? + 2cx = 
a+b+chas at least one real root between 0 and 1. 


Proof First of all, note that a standard application of the IVT is not conclusive, 
for, if f(x) = 4ax? + 3bx? + 2cx — (a+b +c), then f(0) = —(a + b + c) and 
f() = 3a+ 2b +c, so that f(0) and f(1) can have equal signs (for instance, for 
a= land b = c = —3). 

On the other hand, if f(x) = ax*+bx? +cx?—(a+b+c)x, then f(0) = f(1) = 0. 
Hence, Rélle’s theorem assures the existence of xy € (0, 1) such that f’(xo) = 0. 
Now, note that this is the same as saying that xo is a root of the given equation. O 


Let’s (heuristically) analyse a situation more general than that of Rélle’s theorem. 
More precisely, let’s consider again a function f : [a,b] > R, continuous in [a, 5] 
and differentiable in (a,b), but such that f(a) and f(b) can assume any values 
whatsoever. The slope of the secant line to the graph of f passing through (a, f(a)) 
and (b, f(b)) equals Hehe We translate it parallel to itself until it becomes tangent 
to the graph at a point (c,f(c)), with c € (a, b) (cf. Fig. 9.8). Then, on the one hand, 
the slope of this tangent line equals f’(c); on the other, since it is parallel to the 
original secant line, its slope equals that of the secant, so that 


f(b) -f@) 


fo=— = 


This result is the content of the mean value theorem (MVT, shortly) of 
Lagrange, and it is the content of the coming result. 


Theorem 9.45 (Lagrange) Jf f : [a,b] — R is continuous in [a,b] and 
differentiable in (a, b), then there exists c € (a,b) such that 


f(b) =f@) _ 


=O). 


Proof Let g : [a,b] > R be given by 


ea) = 10) = ((F=*) A + (E*)ro), 
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Fig. 9.8 Lagrange’s mean 
value theorem 


for every x € [a, b]. (Note that the function of x between parentheses is precisely the 
one that defines the secant to the graph of f passing through (a, f(a)) and (b, f(b)).) 

Clearly, g is continuous in [a, b] and differentiable in (a, b), with g(a) = g(b) = 0. 
Hence, by Rélle’s theorem, there exists c € (a, b) such that g’(c) = 0. Now, an easy 
computation gives 


b 
vm=ro- (FOr), 
—-a b-a 
so that 
b 
b=<20=7O= f(b) — f(a) ) LQ 
—a oO 
Example 9.46 For x,y € (—3, 4), show that | tanx— tan y| > |x—yJ. 


Proof If x = y, there is nothing to do. Otherwise, it follows from the MVT of 
Lagrange the existence of a real c between x and y, such that 


tanx — tany = (tanc)(x—y) = ae y). 


However, since | cosc| < 1, we get 


| tanx —tany| = 


1 
7, *- yl 2 lel. 


The following corollary sets an important consequence of Lagrange’s MVT. 


Corollary 9.47 Let I C R be an interval and f,g : I > R be two differentiable 
functions. If f’ = g', then f — g is constant. 
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Proof Since f’ = g’ if and only if (f — g)’ = 0, it suffices to show that f’ = 0 
implies f constant. 

Let f’ = 0 and fix a < bin J. Since f is differentiable in J, it is certainly 
continuous in [a, b] and differentiable in (a, b). Therefore, Lagrange’s MVT gives 
c € (a,b) such that “8 — #"(c) = 0. Hence, f(b) = f(a), and since a and b 
were arbitrarily chosen in J, it follows that f is constant. Oo 


Example 9.48 If f,g : R — R are differentiable functions such that f’(x) = g(x) 
and g’(x) = —f(x) for every x € R, then f(x) = f(0) cosx + g(0) sinx and g(x) = 
g(0) cosx —f(0) sinx, for every x € R. 


Proof Letting h(x) = f(x) sinx + g(x) cosx and I(x) = f(x) cosx — g(x) sinx, we 
have h(0) = g(0), /(0) = f(0) and 
h'(x) =f’ (x) sinx + f(x) cosx + g@'(x) cosx — g(x) sinx 
= (f'(x) — g(x) sinx + (f(x) + g'(@)) cosx = 0. 


Analogously, /'(x) = 0 for every x € R, so that the previous corollary guarantees 
that / and / are constant functions. Hence, 


f(x) sinx + g(x) cosx = g(0) 
f(x) cosx — g(x) sinx = f(0) © 


The above inequalities can be seen as a linear system of two equations in the 
unknowns f(x) and g(x). We can easily solve it to get f(x) = f(0) cosx + g(0) sinx 
and g(x) = g(0) cosx — f(0) sinx. Oo 


In what comes next, we discuss an important generalization of Lagrange’s MVT 
to two given functions, known as Cauchy’s mean value theorem. Note that (9.23) 
reduces to the MVT of Lagrange if g(x) = x. 


Theorem 9.49 (Cauchy) Jf f,g : [a,b] — R are continuous in [a,b] and 
differentiable in (a, b), then there exists c € (a,b) such that 


(f(b) —f(a))g’(c) = (8b) — g(@)f')- (9.23) 


Proof Leth: [a,b] > R be given by 


A(x) = F(b) —f(@)s@ — (gb) — g(@) fd, 


for every x € [a, b]. It is immediate that / is continuous in [a, b] and differentiable 
in (a, b), so that, by Lagrange’s MVT, there exists c € (a, b) satisfying 


h(b) — h(a) 


i ae a 


316 9 Limits and Derivatives 


Now, since h(a) = h(b) = f(b)g(a)—f(a)g(b), we have h’(c) = 0 and it suffices 
to observe that 


h'(c) = (f(b) —f(@))g' (©) — (gb) — g(@)f'(). q 
In Calculus, it is often the case we have continuous functions f, g : [a,b) > R 
such that f(a) = g(a) = 0, and we need to evaluate (if it exists) the limit 


lim fx) 

x—a g(x) 
In general, we refer to such a quotient as an indeterminacy of the form 2 asx =a. 
Problem 15, page 311, taught us how to deal with it when f and g are differentiable 
at a, with g’(a) 4 0. Cauchy’s MVT allows us to get a more refined version of that 
result, which is also known in the literature as l’H6pital’s rule. 


Proposition 9.50 (l’H6pital’s rule for 7) Let f,g : (a,b) > R be differentiable 
functions such that limy+af (x) = limy+a g(x) = 0. If g, g’ € 0 in (a, b), then 


7 
m2) _ 5 = tim iy, 
xa gi (x) x4 g(x) 
Proof Start by continuously extending f and g to [a, b), setting f(a) = g(a) = 0. 
For x € (a, b), it follows from Cauchy’s MVT that 


f&) _ f@M-f@ _ fice) 


g(x) g(x)— g(a) _—_g/(c(x))’ 


for some c(x) € (a,x); in particular, we clearly have lim,_., c(x) = a. 


Nov, if lim,+, ce = L, a slight modification in the hint given to Problem 10, 


(9.24) 


page 291, guarantees that lim,_,, £ co = L. Hence, it readily follows from (9.24) 


that lim,+, a =[. oO 


Example 9.51 We use I’ H6pital’s rule for 2 to compute lim,_,9 a. (Note that 
x t /x is not differentiable at x = 0, so that we cannot use the result of 
Problem 15, page 311). Indeed, both f(x) = 1 — cosx and g(x) = ./x (for x > 0) 
satisfy the hypotheses of the previous proposition and are such that 


_ fd) ; sin x . : 

lim = lim ——*~ = lim? = 

20g) x20 1/Qgx) 250 vasinx = 0 
@) 


Therefore, lim,_+o a = 0. 
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Problems: Section 9.5 


ie 


10. 


11. 


. Show that the equation x 
. Redo Problem 8, page 300, this time under the assumption that f : J > R is 


Let f : R > R be a twice differentiable function, such that f” (x) + f(x) = 0 
for every x € R. Prove that 


f(x) = f(0) cos x + f’(0) sinx, 


for every x € R. 


. Leta > Oandf : R > R be twice differentiable and such that f(x) + Af(x) = 


0 for every x € R. Prove that 


F(x) = f(0) cos(V/Xx) + oS aneen: 


for every x € R. 


. * Let J be an interval and f : J — R be a twice differentiable function, such 


that f” is constant. Prove that f either vanishes identically or is a polynomial 
function of degree at most 2. More precisely, if x9 € 7, show that 


(ohare wat SO) — xa) 


. (OIMU) Find all functions f : R — R such that (f(x) — f(y)? < |x — y/, for 


every x,y ER. 


. Let C R be an interval and c € J. Let f : J > R be a function continuous in / 


and differentiable in J \ {c}. If lim,..f’(x) = J, prove that f is differentiable in 
c, with f’(c) = 1. 


. (Putnam) Let ao, a1,..., 4, be real numbers such that ¢ + 4 +--- + = 0. 


n+l 
Show that the palgioinial function ap + ayx +--+ + oe has at least one real 


root. 


? = xsinx + cosx has exactly two real roots. 


continuously differentiable in /. 


. In each item below, compute the given limit: 


(a) lim,_5o a (c) limy-+0 feos ees 
(b) lim,+9 —¥2* (d) lim,+—1 sin? (x) 


—1 G4?" 


Let f, g : (0, £) > R be given by f(x) = x°/ * sin = + and g(x) = = x'/3—cosx+1. 
Compute, if it exists, lim,—o at 
If f : (a,b) > R is twice differentiable, show that, for every xo € (a, b), 


fm LOO +) + Fo — hy — 2F 0) 
1. fe 
h>0 h2 


= f" (xo). 
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12. Let f : [a,b) — R be twice continuously differentiable in [a, b), with f(a) = 0 
and f(a) 4 0. 


(a) Show that there exists c € (a, b) such that f(x) — (x—a)f’(x) 4 0, for every 
x € (a,c). 
(b) Compute lim,_,, mes in terms of f(a), f’(a) and f”(a) (item (a) 
assures that the denominator doesn’t vanish in the interval (a, c)). 
13. * Prove the following version of |’ H6pital’s rule for indeterminacies 3 at +00: 
leta > Oandf, g : (a, +00) — R be differentiable functions such that g, g’ 4 0 
in (a, +00). If limy-+ +00 f(x) = limy-++00 g(x) = 0, then 


lim POY as lim fo) _ 7 


X—>+00 g! (x) ~ xX—>+00 g(x) 


14. Let f : R — R be a differentiable function such that lim,++o0f(x) and 
lim,— +00 xf” (x) exist and are finite. Compute the possible values of this second 
limit. 

15. Prove the following version of I’H6pital’s rule for indeterminacies =& at a € 

R: let f, g : (a,b) — R be differentiable functions such that g, g’ 4 0 in (a, b) 

and lim,_,¢ g(x) = +00 (or —oo). Then, 


iC ee ie 


Si, 
xa gi(x) x4 g(x) 


16. The differentiable function f : IR — R is such that |f’(x)| < c < 1 for every real 
x, where c is a positive real constant. Show that there exists only one x) € R 
such that f(xo) = Xo. 

17. * Let 7 C R be an open interval, x) € J and f : J ~ Ra continuous function, 
which is differentiable in J \ {xo}. If there exists L = lim,—,, f’(x), prove that f 
is differentiable at xo, with f’(xo) = L. 

18. (Romania) Prove that there does not exist a differentiable function f 
(0,+00) — (0,+00) such that f(x)? > f(x + y)((x) + y) for every 
x,y € (0, +00). 

19. (OBMU) Find all differentiable functions f : R — R such that f(0) = 0 and 
If’ (x)| < [f(x)| for every x € R. 


9.6 The First Variation of a Function 


We saw in Sect.8.2 that every continuous function f : [a,b] — R attains 
extreme (i.e., maximum and minimum) values in [a, b]. In this section, we shall 
discuss procedures that, among other things, allow us to effectively compute the 
corresponding extreme points of such an f, provided it is differentiable in (a, b). 
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In all that follows, 7 denotes an interval, whose interior is the interval obtained 
by excluding its endpoints, if they exist. Thus, if 7 = [a, b], [a, b), (a, b] or (a, b), 
then the interior of J is the open interval (a,b); analogously, if J = [a, +00) or 
(a, +00), its interior is (a, +00), and if J = (—oo,b] or (—ov, D), its interior is 
(—oo, b). 

Broadening the situation of the first paragraph above, our purpose in this section 
is to solve the problem of finding extreme points and values of functions f : J > R 
which are continuous in / and differentiable in the interior of J. To this end, we begin 
with the following 


Definition 9.52 Givena functionf : J > R, we say that x) € J is alocal maximum 
(resp. local minimum) point for f if there exists 5 > 0 such that f(xo) > f(x) (resp. 
Ff (xo) < f()), for every x € TM (xo — 6, xo + 8). 


In Fig.9.9, points xo, x, and xj are of local minimum for the function 
f : (a,b) > R whose graph is depicted. Observe that, in a neighborhood of each 
of these points, the values of f are no less than those it assumes at x0, Xo and XG 
(as indicated by the three dashed horizontal line segments). Also, x4 is the only 
global minimum point of f (i-e., a point in which f attains the minimum possible 
value in (a, b)), whereas xo and xj are not global minimum points. Finally, notice 
that f has also two maximum local points (identify them!), albeit none of which is 
a global maximum point. 

Generically, local maximum or minimum points of f : J — R are called its 
extreme points, or simply its extrema. If / is an open interval and f : J] > R is 
differentiable, we shall show below that its extrema (if any) can be found among the 
points where f’ vanishes. Since these vanishing points will play quite an important 
role in subsequent discussions, it is worth to start by naming them. 


Definition 9.53 Let J C R be an open interval and f : J — R a differentiable 
function. We say that xp € J is a critical point of f if f’ (xo) = 0. 


Fig. 9.9 Local minimum points of f: J > R 
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The notion of critical point has an obvious geometric interpretation: letting J be 
an open interval, it follows from (9.14) that the critical points of a differentiable 
function f : / — R are precisely those points in which the tangent lines to its graph 
are horizontal. 

We can now state and prove the following fundamental result, which is known 
as the first derivative test for extreme points. Although its proof is quite similar to 
that of R6lle’s theorem, we repeat it here for the sake of completeness. 


Proposition 9.54 [f I C R is an open interval and f : I > R is a differentiable 
function, then every extreme point of f is also critical. 


Proof Assume that x9 € J is a local minimum point of f (the proof in the other case 
is completely analogous). Take 6 > 0 such that (xo — 6,.xo +6) C J (since J is open, 
such a 6 always exists) and 


0 < |x—x01 <5 > FO) — f(x) = 0. 


For xp < x < x9 + 6 we have fo) > 0, so that 


fi) = im FOF) , 9 


X—>x9+ x— Xo 


An analogous reasoning with xy — 6 < x < x9 gives us 


f'Go) = tim FO -fEO) - 9 


xX>x0- x—Xo 


Therefore, f’ (xo) = 0. Oo 


As a corollary to the first derivative test, we have the following search criterion 
for extreme points. 


Corollary 9.55 Let f : 1 — R be continuous in I and differentiable in the interior 
of I. If f attains a global (maximum or minimum) value in I, then the corresponding 
extreme point is either an endpoint of I or a critical point of f. 


Proof Suppose that f attains a global minimum value in J (the case of a global 
maximum value can be treated analogously). Let x9 € J be such that f(x) = 
min{f(x); x € J}. If xo is an endpoint of /, there is nothing left to do. Otherwise, 
xo belongs to the interior of J; however, since the interior de J is an open interval, 
the first derivative test assures that xo is a critical point of f. Oo 


The coming example makes it clear that the above corollary solves the problem 
of finding the global extreme values of a continuous function f : [a,b] > R, 
differentiable in (a, b), as long as we know how to find the roots of the equation 


f'(x) = 0. 
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Example 9.56 Find the maximum value of f : [0, 1] > R, given by f(x) = x — x". 


Solution Since f is continuous in [0, 1], we know from Theorem 8.26 that f assumes 
a maximum value in [0, 1]. By the previous corollary, the corresponding maximum 
point is 0, 1 or a critical point of f. Now, since f’(x) = 1 — 4x°, we conclude that 

1 . “ae . . 1 a 3 . . = — 
zy 18 the only critical point of f, with f (=q) =a Finally, since f(0) = f(1) = 
(== 


ri 

Our next result, a direct consequence of Lagrange’s MVT, teaches us how to get 
the monotonicity intervals of a differentiable function. From now on, we shall refer 
to it as the study of the first variation of a differentiable function.® 


, it follows that the maximum value of f is Ht Oo 


Proposition 9.57 [ff : I — R is continuous in I and differentiable in the interior 
of I, then: 


(a) f’ => 0 in the interior of I if and only if f is nondecreasing in I. 

(b) If f’ > 0 in the interior of I, then f is increasing in I, but the converse is not 
necessarily true. 

(c) f’ < 0 in the interior of I if and only if f is nonincreasing in I. 

(d) If f’ < 0 in the interior of I, then f is decreasing in I, but the converse is not 
necessarily true. 


Proof Let’s prove only items (a) and (b), the proofs of items (c) and (d) being 
entirely analogous. 


(a) Suppose first that f is nondecreasing in J. For a given xo in the interior of /, 
take 6 > 0 such that (xp — 6,x9 + 6) C I. If xo < x < xo + 4, then x € J and 
F(x) —f(@o) = 0. Hence, fonrco) > 0 and, making x > xo+, we get 


deja tg LOO). 


xX>xo+ X— Xo 


Conversely, suppose f’ > 0 in the interior of J. If a,b € I are such that 
a < b, the continuity of f in J guarantees its continuity in [a, b]. On the other 
hand, since (a, b) is contained in the interior of J, we have f differentiable in 
(a, b), so that Lagrange’s MVT guarantees the existence of c € (a, b) for which 


f(b) —f(@) 


a HS) 20. 


Therefore, f(b) > f(a), and f is nondecreasing in /. 
(b) Assume that f’ > 0 in the interior of J, and let a < b be two points of J. Again 
by Lagrange’s MVT, there exists c € (a, b) for which 


8This terminology alludes to the key role of the first derivative in this result, as well as to the fact 
that, classically, the first derivative of a (differentiable) function was called its first variation. 


322 9 Limits and Derivatives 


b-a 


Hence, f(b) > f(a) and f is increasing in J. 


To see that the converse is not generally true, let f : R — R be given by 
f(x) = x°. Then, f is increasing in the whole real line, albeit f’(0) = 0. Oo 


The coming example shows how to use Proposition 9.57 to approach problems 
involving maxima and minima of differentiable functions. 


Example 9.58 Let f : [0, +00) > R be given by f(x) = a Show that f attains 


a global minimum in [0, +00) and compute this minimum value. 


Solution If f is to attain a global minimum in [0,+00), then Corollary 9.55 
guarantees that the corresponding extreme point is either 0 or a critical point of 
f. Computing f’, we get 


2x(x + 1) — (x? + 3)- 1 _* Fa 3 


PO" ele IF 


so that f’(x) = 0 if and only if x = 1 (recall that we must have x > 0). 

On the other hand, since x7 + 2x — 3 = (x + 3)(x — 1), we conclude that 
f’ is negative in the interval (0,1) and positive in the interval (1, +00). Hence, 
Proposition 9.57 says that f decreases in [0, 1] and increases in [1, +00), so that it 
really attains a global minimum, at x = 1. Therefore, the minimum value of f is 


f() =2. o 


We now apply the first variation of functions to give a third proof of the inequality 
between the arithmetic and geometric means. 


Example 9.59 Given an integer n > | and positive reals a1, a2,...,a, we have 


ay + ay +++ +a 
n 


= x/a1a2...an, 


with equality if and only if a; = az = +++ = dy. 


Proof Let’s make induction on n > 1, relying in the discussion that led to (5.2) for 
the case n = 2. Given an integer k > 2, suppose we have established the inequality 
for k — 1 arbitrary positive real numbers, with equality if and only if they are all 
equal. 

Given k positive reals aj,...,ax—1, ax, let f : (0, +00) — R be defined for 
x > Oby 


f(x) = ay tees Fag + x— Keay... a1 X. 


We shall show that f(x) > 0 for every x > 0, with equality if and only if x = a; 
and a; = --- = ag_1; in particular, it will follow that f(a,) > 0, with equality if and 
only if a; = --- = ay_, = ay, as wished. To this end, note that f is differentiable, 
with 


9.6 The First Variation of a Function 323 


f@®=1-Yq... Geax, 


Hence, letting x» = 'V/aq,..- a1, we have f’ < 0 in (0, x0), f’(xo) = 0 and f’ > 0 
in (xo, +00), so that Proposition 9.57 assures f to decrease in (0, x9] and increase in 
[xo, +00). Therefore, f attains a global minimum (only) at x = Xp. 

Finally, a straightforward substitution gives 


f(%o) = ay ++ + ay — (k— 1) Yay. a1, (9.25) 


so that f(xo) = 0 by induction hypothesis. Thus, 


f(x) = fo) = 0 


for every x > 0, with equality if and only if x = x9 and f(xo) = 0. It now suffices 
to observe that, thanks to (9.25) and the induction hypothesis, we have f(xo) = 0 if 
and only if aj = ... = ay_4. oO 


We shall sometimes use the first variation in the form of the following 


Corollary 9.60 Let I be an open interval, f : I > R be twice differentiable and 
xo € I be acritical point of f. If f” > 0 (resp. f"” <0) in I, then xo is the only global 
minimum (resp. maximum) point of f. In particular, xo is the only critical point of f. 


Proof Suppose f” > 0 in I (the case f” < 0 in / is totally analogous). Since f” = 
(f’)’, item (b) of Proposition 9.57 (applied to f’, in place of f) guarantees that f’ 
increases in J. However, since f’(xo) = 0, it follows that f’(x) < 0 for x < xo and 
f'(x) > 0 for x > xo (in particular, xo is the only critical point of f). Then, items 
(b) and (d) of Proposition 9.57 (this time applied to f) guarantee that f decreases in 
IM (—00, xo] and increases in JM |xo, +00). Therefore, xo is the only global minimum 
point of f. oO 


With the aid of the previous corollary, we can given an interesting application of 
the first variation to Euclidean Geometry. 


Example 9.61 We are given an angle ZAOB such that AOB < 90° (cf. Fig. 9.10), 


— — 
as well as a point P inside it. Show how to choose points X € OA and Y € OB such 
that X, P and Y are collinear and XY has the minimum possible length. 


Fig. 9.10 Line segment XY A 
of minimum length 
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Proof In what follows, we shall rely on [4] for the necessary background in 
Euclidean Geometry. ” 7 a 
Taking angles in radians, let AOP = a, BOP = 6 and OXY = 9, so that a 


and B are known and @ is variable (i.e., depends on the position of X along OA). 
Since the sum of the internal angles of triangle OXY equals m radians, we have 
0<@ <a —a—fandOYX =xn—-a—f-89. 

Applying the Sine Law (cf. Sect. 7.3 of [4]) to triangles XOP and YOP, we get 


PX sina ‘ PY sin B 
and —— = ————__,, 
OP sin 0 OP sin(a + 8 + 0) 


so that 


XY = PR+ PY = OP(—S + sin B ) 
~ 7 sind singa+B+0))- 


Hence, letting f : (0,7 — a — B) > R be given by 


sina sin B 


ae sind = sinfa+ B+ 6)’ 


it suffices to show that there exists a single 0 € (0, 7 — a — B) in which f attains its 
global minimum. 

The first derivative test assures that, if f has a global minimum point, then it must 
be a critical one. On the other hand, an easy computation furnishes 


sinacos@  sinBcos(a+ 6B + 0) 


£6) = sin? 0 sin?(a + B + 0) 


Substituting y = 2 — a — B, we easily get 
lim f’(0) =—oo and lim f’(@) = +o. 
G—>0 é>y 


Therefore, the IVT (applied to f’, which is clearly continuous on 6) guarantees the 
existence of 6 € (0, y) such that f’() = 0. Now, 


sina(1 + cos? 6) sin B(1 + cos?(a + B + @)) 


> 0, 
sin? @ sin'(a + B + 6) 


f"(0) = 


since a, 8, 8,a + B + 6 € (0,7). Hence, Corollary 9.60 says that 4 is the only 
global minimum point of f. oO 
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We finish this section by applying Proposition 9.57 to prove the following result, 
known as Darboux’s theorem.” 


Theorem 9.62 (Darboux) /fJ C Ris an interval andf : I — R is a differentiable 
function, then the function f' : I + R, even if discontinuous, satisfies the mean 
value property. 


Proof Let a < bin J and d be a real number between f(a) and f(b). We wish to 
establish the existence of c € [a, b] such that f(c) = d. To this end, we consider two 
cases separately: 


(i) f’(a) < 0 < f’(b) (or vice-versa): if f is not injective in [a, b], then there exist 
a < B in [a,b] such that f(a) = f(6). By Lagrange’s MVT, there exists c € 
(a, B) (hence, c € (a,b)) such that f’(c) = fees ta) — 9, If f is injective in 
a, b], it follows from Theorem 8.35 that f is monotonic in [a, b]. Therefore, the 
previous proposition gives f’ > 0 in [a,b] orf’ < 0 in [a, b], which is not the 
case. 

(ii) f’(a) < d < f’(b) (or vice-versa): let g : 1 > R be given by g(x) = f(x) — 
so that g is differentiable, with g’(x) = f’(x) — d for every x € I. Hence, 
g'(a) < 0 < g’(b) or vice-versa, and item (i) assures the existence of c € (a, b) 
such that g’(c) = 0. This is the same as f’(c) = d. Oo 


Problems: Section 9.6 


1. Prove Proposition 6.25 applying the results of this section. 
2. In each of the following items, find the maximum and minimum values (if these 
exist) of the given functions: 


(a) f : 0, +00) > R given by f(x) = 
(b) f : R > R given by f(x) = 


oe 
where a and b are given positive real 


numbers. re ma 

(c) f : (0, +00) > R given by f(x) = x? + i where a is a given positive real 
number. 

(d) f : (0, +00) > R given by f(x) = zy, where a is a given positive real 
number. 


3. Let f : R > R be given by f(x) = x? + ax* + bx +c, where a, b,c € R are real 
constants. Discuss the first variation of f in terms of a, b and c. 

4. * Show that x > sinx > x— x and cosx > 1 — _ for every x > 0. 

5. In a square ABCD of diagonals AC and BD, the sides have length 2. Mark the 


midpoint P of side AB and, then, a point Q € AD and a point R € CD such that 


° After Jean-Gaston Darboux, French mathematician of the XIX and XX centuries. 
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POR = 90°. Find the position of Q along side AD such that the area of triangle 
PQR is as large as possible. 

6. Let f : (a,b) — R be a differentiable function and P(xo, yo) be a point not 
belonging to the graph of f. If there exists A € Gy such that 


AP = min{ A’P; A’ € G7}, 


we say that AP is the distance from P to G;. In such a case, prove that AP Lr, 
where r denotes the tangent line to Gy at A. 

7. Letf : (a,b) > R and g: (c,d) > R be differentiable functions whose graphs 
do not intersect each other. If there exist points A(a,f(a@)) and B(6,f(B)) such 
that 


AB = min{ A’B’; A’ € G; and B’ € G,}, (9.26) 


we say that AB is the distance between the graphs of f and g. In such a case, 
Show that f’(@) = g’(B). 

Given differentiable functions f : (a,b) — R and g : (c,d) — R whose 
graphs do not intersect, the result of the previous problem assures that the 
problem of finding points A € Gy and B € G, satisfying (9.26) is equivalent 
to that of minimizing F : (a,b) x (c,d) > R given by 


F(x, y) = (x—y)? + F() -— g0))’, 


subject to the constraint f’ (x) = g’(y). Nevertheless, the pair of functions f, g : 
(0, +00) — R given by f(x) = + and g(x) = 0 shows that it is not always 
the case that such a pair of points Ae Gy and B € Gy actually exist. The next 
problem applies the above discussion to a case in the positive side of things. 

8. Compute the distance between the graphs of functions f, g : R — R such that 
f(x) = x e g(x) = 1 — (x — 3)’, admitting the (geometrically plausible) fact 
that such a distance is attained by a certain pair of points A € Gy and B € Gy. 

9. In Fig.9.11, we have BC = 2BH. Compute the largest possible value of the 
measure of angle BAC. 

10. ABCD is an isosceles trapezoid of bases AD and BC, with AD > BC, and legs 
AB and CD. If BC = aand AB = CD = b, compute the measure of the angles 
BAD = ADC for the area of ABCD to be as large as possible. 


Fig. 9.11 Maximizing the A 
measure of angle BAC 
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11. We are given an angular region ZAOB, suce that AOB a 90° (cf. Fig. 9.10), 


and a point P inside it. Choose points X € OA and Y € OB i in such a way that 
X, P are Y collinear. Show that PX - PY is minimum if and only if OXY is an 
isosceles triangle, with basis XY. 

12. Let a line r and points A and B be given, as shown in Fig. 9.12. The distances 
from A and B to r are respectively equal to a and J, and c is the length of the 
line segment joining the feet of the perpendiculars dropped from A and B to r. 
Find the position of a point P along r such that the sum AP + BP is as small as 
possible. 


13. In Fig.9.13, we have AA’, BB’ Lr, AA’ = a, BB’ = band A’B’ = 1. If 1 < 
/2(a2 + b?), show that there exists a single point P € r such that AP- BP 
attains its minimum possible value. Show also that P € A’B’. 

14. Show that (@E1)""! 

15. For each real number k > 1, compute the minimum possible value of x + y, 
where x and y are real numbers satisfying (x + V1 +2x2)Qyt+ J/1+y*) =k. 

16. (Romania) Compute the minimum possible value of x + y + for 
distinct and positive real numbers x and y. 

17. (BMO) In a triangle ABC, let sin” 4 cos*8 8 = sin? 8 =e s8 4 . Compute the 


ratio AC 


. (OIMU) Let f : [0,1] — [0, 1] be continuous in [0, 1], differentiable in (0, 1) 
and such that f(0) = 0 and f(1) = 1. Prove that there exist distinct a, b € [0, 1] 
such that f’ (a)f"(b) = 1. 

19. (Leningrad) Let f : R — R be a polynomial function such that 


> (2)", for every m,n EN. 


2 1 
ry Dy? 


1 


(oe) 


f@)-f£@O-f"O +f" = 0 


for every x € R. Prove that f(x) => 0 for every x € R. 
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9.7 The Second Variation of a Function 


Along this section we continue to study how the behavior of the derivatives of a 
function influence the shape of its graph. As before, in all that follows J C R denotes 
an interval. 


Definition 9.63 A function f : J — R is said to be convex if, for all a,b € I and 
t € [0, 1], we have 


F(U — da + th)) < A— f(a) + #(). 


As t varies from 0 to 1, basic Analytic Geometry guarantees that the points of the 
form (1 — f)(a,f(a)) + t(b,f(b)) trace out the line segment secant with endpoints 
(a, f(a)) and (b, f(b)). Hence, f : J > R is convex if and only if, for all a < bin 
I, the portion of the graph of f between lines x = a and x = b does not intersect 
the open half-plane situated above the straight line passing through (a,f(a)) and 
(b, f(b)). Yet in another way, letting 


R+(f) = {@y) € Rs x€ J and y > f(a}, 


f is convex if and only if R4+(f) is a convex region of the plane. 

The next definition refines the notion of convexity for functions (pay particular 
attention to the range of values of ft, when compared to those in the former 
definition). 


Definition 9.64 A function f : J — R is strictly convex if, for all distinct a,b € I 
and every t € (0, 1), we have 


F(A — ta + th)) < 1 —of(@ + tf (0). 


Analogously to the above, we say that f : J — R is (strictly) concave if —/ is 
(strictly) convex. Hence, f is concave if and only if 


f(U — ta + th)) => 1 —of(a) + f(b) 


for all a,b € I and t € [0, 1]. Accordingly, f : 1 — R is strictly concave if and only 
if f(1—ta+tb)) > A —nf(@ + f(b) for all distinct a, b € J and every t € (0, 1). 

Properties of (strictly) concave functions are easily derived from those of 
(strictly) convex ones, just changing f by —f when necessary. For this reason, in 
all that follows we shall restrict ourselves to the analysis of convex and strictly 
convex functions, leaving to the reader the task of adapting it to concave and strictly 
concave ones. 
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Definition 9.63 guarantees that every strictly convex function is convex. Never- 
theless, the converse is not true in general, as the coming example shows. 


Example 9.65 Every affine function is convex, albeit not strictly convex. Indeed, if 
f :R- Ris given by f(x) = Ax + B, with A, B € R, it is immediate to verify that 
f(A -—fa+ tb) = (1 — f(a) + f(b), for all a,b € R and every t € [0, 1]. 


In Corollary 9.70 we will show that, if J is an open interval, then a twice 
differentiable f : J — R is convex if and only if f’(x) > 0 for every x € I. We 
will also show that, if f’(x) > 0 for every x € J, then f is strictly convex. By 
assuming the truth of these results for the time being, we now list some examples 
of convex and concave functions (note that when we sketched the graphs of these 
functions we implicitly assumed that they were indeed convex or concave, according 
to the case at hand). 


Example 9.66 


(a) The restriction of the inverse proportionality function to the set of positive reals 
is strictly convex. Indeed, letting f(x) = + for x > 0, we have f” (x) = = > 0. 

(b) Letting n > 1 be an integer, the function f : (0, +co0o) > R given by f(x) = x" 
is also strictly convex, for, f” (x) = n(n — 1)x"-? > 0. 

(c) The sine function is strictly concave in the interval (0,7), since sin” x = 
—sinx < 0in (0,z). 

(d) Since tan” x = 2 tanxsecx > Oin (0, z), the tangent function is strictly convex 
in this interval. 


Remark 9.67 If f : I > R is convex, then, letting tf = 5 in Definition 9.63, we 
get f (4°) < Borne) for all a,b € I. Conversely, if f is continuous and such that 
f (44) < Aarne) for all a,b € T, it is possible to show (see Problem 8) that f 
is convex. By the same token, it is also possible to show (see Problem 9) that a 
continuous f : J > R is strictly convex if f(4*) < Ha tF@) for all distinct a, b € J. 
In turn, these remarks allow us to use the elementary inequalities we had studied to 
establish the (strictly) convex or concave character of several common functions. In 
this respect, take a look at Problem 11. 


Back to the development of the theory, let f : J > R be a convex function. Then, 
givena < bin/J and 0 <t < 1, we have 


f(A — ta + th) < Ul —of@ + (0). (9.27) 


Letting x = (1 — ta + th, we have t = —* andl —t= =e Therefore, the last 
inequality above can be written as 


b— = 
fQ) < = -fla) + — f(b). 
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f(0) 


(1—t) f(a) + tf() 


f(a) 
f((1—t)a + tb) 


(1—t)a+ tb 


Fig. 9.14 Graph of a convex function 


In turn, this gives 


f@)-f@ 1 (b=x xa f(b) —f(@) 
= f@+ ‘f() —f@ }) = —— 
x—a x-—a\b-a b-a b-a 
and, analogously, LO) fe) > a. In short, letting f be convex in J, we have 


fM-f@ _fO)-f@ _ fO-fO) 


9.28 
x-a = b-a ~ b-x ( ) 


for alla <x <binI. 
Conversely, if 


f@)-f@ _ £0) =f) 


x-—a a b-x 


, (9.29) 


for alla <x < bin/, then (b —x)(f(®) —f(@) < (x— a) (f(b) —f()) or, which is 
the same, 


(b—a)f(x) < (b—xf@ + @— af (0). 


Substituting x = (1 — Aa + tb in this inequality, we immediately recover (9.27), so 
that f is convex. 

Notice that the quotients in (9.28) are the slopes of the secants to the graph of f 
and passing through the pairs of points (a, f(a)) and (x, f(x)), (a, f(a)) and (b, f(b)), 
(x, f(x)) and (b, f(b)), respectively. On the other hand, such slopes are the tangents 
of the trigonometric angles measured from the horizontal axis to those secants. Since 
the tangent function increases in each of the intervals (0, ) and (3, zr), the reader 
can easily use the comments above to grasp the “turning behavior” of those secants, 
as we walk along the horizontal axis in the positive direction (see Fig. 9.14). 
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In what follows, we use the previous discussion to show that a convex function, 
defined in an open interval, is continuous. 


Proposition 9.68 [f I is an open interval and f : I — R is convex, then f is 
continuous. 


Proof Fix x9 € Tanda < xo < b also in J (this is possible due to the openness 
of /). For x € (x0,b), the first inequality in (9.28), with xo in place of a, gives 


font) < poeo) in turn, the second inequality in (9.28), with x9 in place of x 


and x in place of b, gives E=sto) Go) 5 [20 By combining these two inequalities, 
P 0 Xo—a 
we get 


(SPO) (oy = 700 — Fe) < (LOD) oo) 


Xo —a 


Letting x — xo+, it follows from these inequalities, together with the squeezing 
theorem (cf. Proposition 9.8) that 


lim, #0) = FG). 


Arguing in an analogous way, we conclude that lim,—+,,— f(x) = f(xo). Therefore, 
Problem 5, page 290, guarantees that lim,-,,, f(x) = f(xo), and Proposition 9.4 
shows that f is continuous at xo. Oo 


As we have previously commented, we shall now show in a moment that the 
(strictly) convex character of a twice differentiable function is intimately related to 
the sign of its second derivative. This, in turn, will be a straightforward consequence 
of the coming result, in which we assume that the function under consideration is 
merely differentiable. 


Theorem 9.69 /f 1 is an open interval andf : I — R is differentiable, then: 


(a) f is convex in I if and only if f’ is nondecreasing in I. 
(b) f is strictly convex in I if and only if f' is increasing in I. 


Proof Firstly, let f be convex in J. Given a < bin J and x € (a,b), it follows 
from (9.28) that 


fQ)-f@ _ f)-f@ _ fO)=fe) 


x-a b-a -_ b-x 
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However, since f is differentiable in 7, letting x — a+ in the first inequality and 
x —> b— in the second one, we obtain 


b) —fla 
f(a) OL <p@, 


Hence, f’ is nondecreasing in /. 
If f is strictly convex, we refine the above reasoning bay taking c € (a,b) and 
x € (a,c), y € (c,d). Then, (9.28) and the strict convexity of f give 


fO—-f@ _fO-I/@ _fO-fO _fO-fY 


x-a c—a b-—c - p=y y 
Now, letting x — a+ and y — b-, we get 


fO-f@ _ fO)-fO 
Cc 


c-a b- 


f@s <f'(b), 
so that f is increasing in J. 

Conversely, suppose that f’ is nondecreasing in J. To show that f is convex, we 
previously saw that it suffices to prove that 


fx) =f@) _ fb) = f@) 


x-a > b-x 


for alla < x < binTJ. Since J is open and f is differentiable in J, we have f 
continuous in [a,b] and differentiable in (a,b). Hence, Lagrange’s MVT assures 
the existence of a € (a,x) and 6 € (x,b) such that Annie = f’(a) and 
(Ot) me — & — f'(B). However, since a < f and f’ is assumed a be nondecreasing, 
we nave ya) < < f’(B). Thus, 


b 
fa) -f@ = = f(a) <f'(6) = a 


x= 


as we wished to show. 
In case f’ is increasing in J, the above reasoning gives 


fa) —f@ KO) FO)-FG) 
b-—x 


x— 


= f(a) <f'(B) = 


Hence, we get a strict inequality in (9.29) for alla < x < bin TJ, and f is strictly 
convex. oO 


Corollary 9.70 Let I be an open interval and f : I > R be twice differentiable. 


(a) f is convex in I if and only iff" > 0 in 1. 
(b) If f"” > 0 inI, thenf is strictly convex in I. 
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Proof Item (a) follows immediately from item (a) of the previous result, together 
with item (a) of Proposition 9.57 (applied to f’). 

In what concerns (b), if f” > 0 in J, then item (b) of Proposition 9.57 (applied 
once more to f’) guarantees that f’ increases in J. Therefore, item (b) of the previous 
theorem assures that f is strictly convex. Oo 


Let us see a simple application of this result. 


Example 9.71 Let f : R \ {-V2, /2} — R be given by f(x) = +. Find the 


2—x2° 
intervals of the real line in which f is strictly convex or strictly concave. 


Solution Computing the first and second derivatives of f, we get f’(x) = ae 


x 
and f” (x) => CDG: Hence, 


: x>Oand2-—x?>0 0<x< V2 
f"Q) > 0 5 520 or > or 
ss x<Oand2—x <0 eer 


We readily conclude that f is strictly convex in each of the intervals (—oo, —/2) 
and (0, /2), and strictly concave in each of the intervals (— /2, 0) and (/2, +00). 
oO 


The former corollary is usually referred to as the study of the second variation 
of a twice differentiable function. We now wish to look at the points in which the 
graph of f changes from being convex to being concave, or vice-versa. Before that, 
it is useful to have the following definition at our disposal. 


Definition 9.72 Let J be an open interval and f : J > R be a continuous function. 
A point xo € J is said to be an inflection point of f if there exists 6 > 0 such that f 
is convex in] M (xo — 6, x) and concave in JM (xo, Xo + 4), or vice-versa. 


The coming corollary gives a necessary condition to be satisfied by the inflection 
points of twice differentiable functions. 


Corollary 9.73 Let I be an open interval and f : I > R be a twice differentiable 
function. If xo € I is an inflection point of f, then f" (xo) = 0. 


Proof Suppose f is convex in (xo — 6, x9) and concave in (xo,x9 + 6), with 6 > 0 so 
small that (x) —6, x) +6) C J (the other case can be dealt with in an analogous way). 
Then, Corollary 9.70 gives f” (x) => 0 in (xo — 5, xo) and f” (x) < 0 in (xo, x9 + 4). 
Suppose f” (xo) < 0, and fix an arbitrary a € (xo — 5, x9), so that f” (xo) < 
0 < f’ (a). Since f” (x0) < a < f" (a), Darboux’s Theorem 9.62 assures the 


existence of b € (a, xo) such that f”(b) = P's) < 0. However, since b € (a, xo) > 

b € (xp — 6, x0), we should also have f” (b) => 0, which is a contradiction. 
Analogously, we also reach a contradiction by supposing that f” (xo) > 0. Hence, 

the only possibility is that f” (x9) = 0. Oo 
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We now state and prove Jensen’s inequality,!° a result that gives greater 
flexibility to the applications of convex functions to problems of maxima and 
minima. 

Theorem 9.74 (Jensen) LetI C R be an open interval and f : I > R be a convex 
function. If x,,X,...,X, € Land ty, to,...,t, € (0,1), witht} +t +---+4, = 1, 
then t1xX, + toX2 +--+ + tyX, € Tand 


Sf (tia + tox2 +++ + ttn) < tif 01) + tof (2) + +++ + taf Qn). (9.30) 


Moreover, if f is strictly convex, then equality happens if and only if x} = x2 = 
= Xp. 


Proof Suppose f is strictly convex (the case of a merely convex f is totally 
analogous) and let’s prove Jensen’s inequality by induction onn > 1. Case n = 2 
follows from the definition of strict convexity for f, since condition t; + t2 = 1 is 
equivalent to 4, = tandt = 1-—t. 

Suppose that, for a certain n > | and all x1,%2,...,%, € Jandt,h,...,th € 
(0, 1), with t) + tf +---+t, = 1, we have tx; + tyx2 +--+ tx, € J and 


F(tim1 + fox. + +++ + tnxn) < tf (1) + tof 2) + +++ + ef On), 


with equality if and only if x1 = x. = +--+ = x,. Let’s consider x1, x2, ..., Xn, 
Xn41 € Tand ty, fo,...,tn, toi € (0, 1) such that 4) +f +---+t+t%41 = 1. 
Define 


OX + 1X2 bt + tnXn 


= SX + S2X2 +++ + SpXn, 
1 thy1 


. i ‘ 
with s; = Tg) for l <j <n. Then, s1,52,...,5, > 0 and 
n 


T 
=> ——__ a —t +1) = 1, 
1 —ty41 =I l—t4i " 
so that s; € (0, 1) for 1 < j < n. Hence, induction hypothesis gives y € / and, thus, 
tx, + x2 bee + tyXn + Int itn = (l= tag iy + freeing € L. 


Now, by invoking the strict convexity of f we obtain 


A (tix + tox2 Fe + tpg tng) = f( = the dy + tri X41) 
a l == tot fy) ++ tof On+1). 


10 After Johan Jensen, Danish engineer and mathematician of the XIX and XX centuries. 
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with equality if and only if y = x,4,. On the other hand, also by induction 
hypothesis we have 


f(y) =f (s1x1 + S2x2 + +++ + SpXn) < Sif) + Sof (X2) + +++ + Sf Xn) 


1 
= if Or) + taf 2) +2 + taf Bn), 
~ 'n+1 
with equality if and only if x) = x2 = +++ = Xp. 


Taking the two inequalities above together, we conclude that 


A (tix + tex. ++ + tht iXngi) S = td fO) + tro if Qn+1) 
= (if) Sa taf (%n)) + tnoif Xn+1); 


with equality happening if and only if y = x,4; and x; = x. = +++ = X,. Finally, 
it’s immediate to verify that these conditions are equivalent to x) = x. = ++: = 
Xn = Xn+1- oO 


Most often, we shall apply Jensen’s inequality in the form of the coming 
corollary, which is stated for the reader’s convenience. 


Corollary 9.75 (Jensen) Jf C R is an open interval and f : I > R is a convex 
function, then, for x,,X2,...,X, € I, we have 


(9.31) 


n n 


p(s) ee ee 


Moreover, if f is strictly convex, then equality happens if and only if x} = x2 = 


soo X. 


Proof Lett) =h=-::-=th,= 4 in the previous result. oO 


We leave to the reader the task of stating Jensen’s inequality in the case of a 
(strictly) concave function f. Presently, we concentrate ourselves in discussing some 
applications of it. 


Example 9.76 Given n > | positive reals a), az, ..., dn, prove that 


£1 1 5 
(Qi Ga t+ da) | ee, 
a\ a2 


an 


with equality if and only if a; = az = ++: =a). 


Proof Example 9.66 showed that the inverse proportionality function f 
(0, +co) > R, given by f(x) = 1, is strictly convex. Hence, it follows from (9.31) 
that 
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n =; (4402 *) 
a) + a2 +++ + An 


n 
—_ fa) + f(a) + + + FG) 
~ n 
= 1/ay + 1/az +-+++ 1/an 
7 n 
with equality if and only if a; = a. = --- = a,. Now, it suffices to observe that 
inequality 
n e Lait l/ar+-+++1/an 
aq, t+d.+---+a, n 
is precisely what we wished to get. Oo 
Example 9.77 (BMO) Letn > 1 and a,...,d, be positive reals with sum equal to 


1. For each 1 <i <n, letb) = a; + ---+ qG—) + Gi4) +--+ + Gy. Prove that 


a, ru a2 + + an 3 n 
1+h l+bh 1+ by ~ 2n-1? 
with equality if and only if a,j = a7 =---=a, = 4. 


n 
Proof Substituting b; = 1 — a; for 1 < i < n, it suffices to prove that 
ay a2 an n 


be. > ; 
Tn ae 5a On 


To this end, we claim first that f : (—oo,2) > R, given by f(x) = 4, is strictly 
convex. Indeed, one readily computes f” (x) = = > 0. 


Hence, by applying Jensen’s inequality we get 


n 1 re 1 5 


with equality if and only if aj = dy) = +++ = ay, ie., if and only if a; = az =+-- = 


Gn = 7: oO 


Our next example applies Jensen’s inequality to solve an interesting geometric 
problem. 


Example 9.78 Let T bea semicircle of radius R and diameter AoA. For each integer 
n > 2, show that there exists a single convex n-gon AgA Az... A,—; Satisfying the 
following conditions: 


(a) Ad, siete Ay ef. 
(b) The area of ApA,A2...A,—1 is as large as possible. 
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Fig. 9.15 A convex n-gon A3 
inscribed in a semicircle 


Ao O A, 


Proof Figure 9.15 depicts a convex n-gon AgA;A2...A,—1 inscribed in I’. Let 
A,OAi+1 = a; for 1 < 1 < n—1, where Os the center of I’ (and with the convention 
that A, = Ao). Hence, a + @2 +++++@,—-1 = a. Applying the sine formula for the 
area of a triangle (cf. Sect. 7.3 of [4]), we get 


n—-1 n—-1 
1 a 
A(AoA1...An-1) = )_A(AiOAi41) = D> 5 sin Aj;OA:+1 


i=1 i=1 


Now, since the sine function is strictly concave in the interval [0, zr], it follows 
from (9.31) that 


n—-1 n—1 
1 a 
Y- sina; < (n—1)si y\a;) = (2—1)si 
sina; < (n sin ( ye) (n — 1) sin i 


; n—1 n— 
i=1 


with equality if and only if a) = --- = a,—| = =4,. Therefore, there is indeed only 
one convex n-gon satisfying the prescribed conditions. oO 


Problems: Section 9.7 


1. Let J,/ C R be intervals, and g : ] > J and f : J > R be two strictly convex 
functions. If f is increasing, prove that f o g : J > R is also strictly convex. 

2. Let I,J C R be intervals and f : J > J be a continuous bijection. If f is strictly 
convex, prove that f—! : J + J is either strictly convex or strictly concave in J. 

3. * Prove that the sum of a finite number of strictly convex (resp. strictly concave) 
functions on a common domain is also a strictly convex (resp. strictly concave). 


4. Let f : (—oo, 1) > R be the function given by f(x) = Ti Find the intervals 


in which f is convex (resp. concave). 
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3: 


10. 


11. 


12. 


13. 


14. 


15. 


16. 
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Let a,b,c € Randf : R > R be given by f(x) = x° + ax? + bx + c. Find 
the inflection points of f, as well as the intervals along which it is convex or 
concave. 


. Let f : (0, +00) > R be given by f(x) = x’ sin 1. Show that f has infinitely 


many inflection points. 


. Let f : (0, +00) — R be a twice differentiable, nondecreasing and convex 


function. If g : (0, +00) — R is given by g(x) = xf(a), show that g is also 
convex. 


. Let f : J — R be a continuous function such that f (Be) < Aoi) for all 


xyel (resp. f(=) > foro) for all x, y € I). Prove that f is convex (resp. 
concave). 


. Let f : J > R be continuous and such that f(4) < fore) (resp. f(4") > 


Bane), for all distinct a, b € I. Prove that f is strictly convex (resp. concave). 
Let f : (0, +00) — R be a continuous, increasing (resp. nondecreasing) and 
convex (resp. strictly convex) function. Prove that g : (0, +coo) > R, given by 
g(x) = xf (x), is also strictly convex. 

Use the results of Problems 8 and 9 to establish the strictly concave or convex 
character of each of the functions of Example 9.66. 

Let / be an open interval and f : J —> R be a twice differentiable convex 
function. Show that, for every xo € J, the graph of f is not below its tangent line 
at x9. More precisely, show that, for all x, xo € J, we have 


f(x) = f (xo) +f’ (x0) — x0). 


(Romania) Let J C R be an interval and f : J — R be strictly convex 
and increasing. Prove that the sequence (f(7)),>1 does not contain an infinite 
arithmetic progression. 

Let k > 1 be an integer and aj, d2,..., a, be positive reals. Prove that 


n n 


ee ee (2 tars 
with equality if and only if aj = az = +++ = ay. 
Let T be the circle of center O and radius R, n > 2 be a given integer and 
A,A2...A, be a convex n-gon inscribed in T’. 


(a) Suppose that O does not belong to the interior of AjA2...A,. Use a 
geometric argument to show that there are convex n-gons inscribed in I" 
and with area strictly greater than that of AjA2...Apy. 

(b) Use Jensen’s inequality to show that, among all convex n-gons inscribed in 
T’, the regular ones are those of maximal area. 


Let be given a circle I’, with center O and radius r, and an integer n > 2. Among 
all convex n-gons A,A2...A, circumscribed to I’, prove that the regular ones 
are those of minimum perimeter. 
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17. 


18. 


19. 


Let ABC be an acute triangle of sides AB = c, AC = band BC = a. If Ris its 
circumradius (i.e., the radius of the circle circumscribed to ABC), show that 


at+tb+c < 3RV3, 


with equality if and only if ABC is equilateral. 
(Romania—adapted) Let ABC be an equilateral triangle of height h, let P be an 
interior point and x, y and z be the distances from P to the sides of ABC. 


(a) Prove thatx+y+z=h. 
h-y 


7 7 h—-x 
(b) Find the least possible value of ag a +4 i oo 


(Turkey) Let n > 1 be an integer and x1, x2,...,X, be positive reals with sum 
equal to 1. Prove that 


oe NA + Vin 
T= 1 J/n—1 


with equality in any of the above inequalities if and only if x; =--- =x, = 4. 


9.8 Sketching Graphs 


Let be given an interval J and a continuous function f : J — R, which is twice 
differentiable in the interior of J. In principle, the theory developed so far allows us 


to 


draw a reasonably accurate sketch of the graph of f. Indeed: 


(i) The results collected in Problem 10, page 194, allows us to possibly reduce the 
task of sketching the graph of f to doing so in an appropriate interval J C I. 


(ii) The first derivative test guarantees that the extreme points of f are either the 


endpoints of J or the solutions of equation f’(x) = 0. 


(iii) The study of the first variation of f assures that the intervals along which f 


increases (resp. decreases) are the solution sets of the inequality f’(x) > 0 
(resp. f’(x) < 0). 


(iv) The inflection points of f are found among the solutions of equation 


f' (x) = 0. 


(v) The study of the second variation of f guarantees that the intervals along 


which f is strictly convex (resp. strictly concave) are the solution sets of the 
inequality f” (x) > 0 (resp. f” (x) < 0). 


(vi) The tangent lines to the graph of f in its inflection points help us in sketching 


(vii 


the graph in a neighborhood of each of those points. 
ii) Definition 9.18 and Problem 11, page 291, teach us how to find (if any) the 
asymptotes to the graph of f. 
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(viii) 


(ix) 


9 Limits and Derivatives 


The computation of the limits lim,_,+,. f(x) (in case they make sense) allows 
us to better understand the behavior of the graph of f for large values of |x|. 
Plotting some other points along the graph can be quite helpful in sketching it. 


In this section, we illustrate the application of the program just delineated to 
sketching the graphs of some simple functions. 


Example 9.79 Let D = R\{4-+ks; k € Z}. Sketch the graph of the secant function 
sec:D—>R. 


Solution 


(i) 


(ii) 


(iv) 


(vii) 


(viii) 


(ix) 


First of all, note that it suffices to sketch the graph of sec in (—4, 5) U(F, = : 
for, since the cosine function is periodic of period 27, so is the secant function. 
On the other hand, since cos(x + 7) = —cosx for every x € R, we have 
sec(x + 7) = —secx for every x € D; hence, Problem 10, page 194, 
guarantees that the portion of the graph of sec in the interval (4, *t) is 
obtained from that in the interval (-+, 4) by means of a reflection along 
the horizontal axis, followed by a translation of z units, parallel to that same 
axis. 

In view of the above, from now on we restrict the analysis of the graph of the 
secant function to the interval (—4, 5). 

and (iii) Since 

sin x 


sec’ x = (cos! x)’ = —(cosx)~?(— sinx) = — 
cos? x 


= tan x sec x, (9.32) 
we have sec’x = 0 © sinx = 0 © x = O. (Indeed, 0 < cosx < 15 
secx > 1, with equality if and only if cosx = 1, ie., if and only if x = 0. 
Hence, x = 0 is an extreme point—thus, a critical one—of sec.) On the other 
hand, sec increases & sec’'x > 0 sinx >O0OS xe (0, z); therefore, sec 
decreases in (—%, 0). 

and (v) Firstly, Example 9.34 and (9.32) furnish 


sec” x = tan’ xsecx + tanxsec’ x = sec? x + tan* xsecx 


= sec x(sec” x + tan? x). 


Therefore, sec” 4 0, and sec has no inflection points. Also, since sec is strictly 
convex if and only if sec” x > 0, which in turn happens if and only if cosx > 
0, we conclude that sec is strictly convex in (=F; : 

Since sec is periodic, it graph does not contain horizontal or oblique asymp- 
totes. On the other hand, we easily get lim,_.x_ secx = lim,,_*4 secx = 
+00, so that the vertical lines x = + are asymptotes of the graph of sec. 
Since sec is periodic and nonconstant, there does not exist lim, +0 f(x). 


Computing cosx for x = 0,45, £%, +74, +4 and tH, we plot points 


(0, 1), (+2, +); (+4, v2) and (+4, 2) on the graph of sec. 
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I 


Fig. 9.16 Graph of sec in (—4, 4) U (4, *) 


Gathering together the above information, we sketch the graph of sec in 
the interval (—4, 5) U (4, *t) as shown in Fig. 9.16. 
Oo 


Example 9.80 Sketch the graph of the polynomial function f : R > R, given by 
f(x) = 2x? + 497 + 2x-1. 


Solution 
(ii) and (iii) Since f’(x) = 6x” + 8x +4 2, the critical points of f are x = —1 and 
> il 1 
x = — 3. Thus, f’(x) < 0 @ x © (—1,—3), and f decreases > f"(x) < 
OSxeE (—1, -4). Therefore, f increases x € (—oo,—1) U (—4, +00). 
It follows that x = —1 (resp. x = —4) is a local maximum (resp. minimum) 
point of f. 


(iv) and (v) Since f” (x) = 12x + 8 and f is strictly convex (resp. concave) in an 
interval J if and only if f’(x) > 0 (resp. f(x) < 0) in J, we conclude that 
f is strictly convex (resp. concave) in (-4, +00) (resp. (—oo, —4)). Finally, 
x= —4 is the only inflection point of the graph of f. 

(vi) Notice that f (—4) = —#, and the tangent line to the graph of f in x = —4 
has slope f’ —4) a —3, Letting (0, c) be the point where such a tangent line 


c—(—31/27) __ 


: : : y) — _ 4 
intersects the vertical axis, we have Tey = ~ 37 80 that c = ; 
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Fig. 9.17 Graph of the 
polynomial function 
f@) = 2 +47 + 20-1 
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(vii) Since f is defined in the whole real line, its graph does not possess vertical 


asymptotes. On the other hand, since lim,-.+o0 ie 


= lim,-+ 


o0(2x7 + 4x + 


- +) = +00, Problem 11, page 291, guarantees that the graph of f does not 


have oblique asymptotes. 


(viii) It follows from Example 9.17 that lim,_,+60 f(x) = 


+oo. 


(ix) Since f(0) = —1, it follows from (viii) and the IVT that f has a positive root 


a. Companne ys) forx = —3, -1,-4,0 


1 a | 
> 4392 and 1, 


we plot points 


(= 3? —1), {= i =1), ‘= +,-33), (0, =1), (;.-). (3, H)s (5. >) and (1,7) 


on me graph of f. (Recall that, in item (vi), we had already get point (—2 =,- 


on the graph.) In particular, we conclude that a € (+ 7 + 
Finally, as in the previous example, we gather the above information to sketch 


the graph of f in Fig. 9.17. 


=) 


oO 


Example 9.81 Sketch the graph of f : R \ {-1} > R, given by f(x) = x7 + =: 


Solution 


Qx3-+-4x2- 424-1 
(ii) and (iii) Since f’(x) = 2x — ar = eet 


, we have f’(x) = 0 © 


2x3 + 4x? + 2x—1 = 0. Hence, the only critical point of f is the only positive 
root a of the polynomial function of the previous example, so that a € (;. + 
Then, f’(x) > 0 @ 2x3 + 4x7 + 2x-1>0 <x >a, and we conclude that 
f increases in (a, +00). Note a a@ is aminimum local point of f. 


(iv) and (v) Since f’(x) = 2+ mH 


Gp We have f a convex (resp. concave) 


in an interval / if and only if => > —1 (resp. sy < —1) in. Therefore, f is 
strictly convex in each of the anteevale (—oo, 5 and (—1, +00), and strictly 
concave in (—2, —1). Moreover, its only inflection point is x = —2. 
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(vi) Note that f(—2) = 3, and the tangent line to the graph of f at x = —2 has 


slope f’(—2) = —5. Letting (0, c) De the point in which such a tangent line 


intersects the vertical axis, we have = mea = —5, so that c = —7. 


(vii) Since x = —1 is the only point for which f(x) is not defined, the vertical 


line x = —1 is the only candidate for vertical asymptote of the graph. This 
is indeed the case, for, lim,,_;_ f(x) = —oo and lim,,-14 f(x) = +00. 


On the other hand, since lim,.+9 ae = limys+oo (x + <n) = +o, 


Problem 11, page 291, assures that the graph has no oblique asymptotes. 


(viii) It follows from Example 9.17 mat Timy-+-£00 f@wm= vo: 
1 


(ix) Computing f(x) for x = -3, —3,- a oe -t ,0, + ‘ 3, 1 and 2, we a 
points (—}, 5), (—3.- as (3-3), 3.5, asa asa OS 


(4. —<). (. 3), d, 3) and (2, 43) on the graph of f. (Recall that, in item A 
we had already get point (—2, 3) on the graph.) 

Once more, we collect all of the above to sketch the graph of f, as shown in 
Fig. 9.18. Note that, for the purpose of a better qualitative picture, we adopted 
different scales on the horizontal and vertical axes. 


oO 


Fig. 9.18 Graph of 


f@) = 
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Problems: Section 9.8 


1. Sketch the graph of the cosecant function, csc : R \ {ka; k € Z} > R, such that 
cscx = 5. 

2. In each of the items below, sketch the graph of the given function (defined in 
its maximal domain). For each such function (and whenever pertinent), compute 
explicitly or estimate the critical points, monotonicity intervals, inflection points, 
tangent lines at inflection points, intervals of convexity or concavity, asymptotes 
and behavior at infinity: 


(a) f(x) = x4 + 22 -—2n41. (e) f(%) = t+ sh. 
(b) f@) =x4+4. (f) f(x) = wy 

(c) f(@®) = sh. (g) f@) = xr 

(@) f@) = sh. () f@) = Ge 


3. * Let f : (a,b) — R be twice differentiable at x9 € (a,b), with f” (xo) # 0. 
Since f is in particular differentiable at xo, the tangent line 7 to the graph at 
A(xo,f(x0)), is not vertical. We define the osculating circle to the graph at A as 
the circle [’ tangent to r at A that better approximates the shape of the graph in 
a neighborhood of xo, in the following sense: taking c € (a, xo) and d € (xo, b) 
such that one of the arcs of I situated in the strip defined by the vertical lines 
x = candx = dis the graph of a function g : (c,d) > R, we have g(xo) = f (xo), 

g’ (xo) = f’ (xo) and g” (xo) = f” (xo). In this respect, do the following items: 


(a) Let O(a, B) be the ponte of I. If f’(xo) = 0, show that a = xo; if f’(x0) 4 0, 
show that 2/@0 — 


a—X0 “7 ay" 


(b) Show that the equation of T is (x — a)? + (y — B)? = R’, where R = 


vy (xX — a)? + (f (x0) — B)? is its radius. 


(c) For x € (c,d), substitute y = g(x) in (b) and differentiate the relation thus 
obtained to get 9’(xp) = POX = f’ (xo) and 9” (x9) = 1+ f" (xo)? | 


B—g(x0) ‘ B-f (xo) f 
+f’ +f’ 
(d) Conclude that B = f(x) + tf at and a = x9 —f’ (xo) ( + eu i; From 


this, and with respect to the line r, show that O is located in the upper (resp. 
lower) half-plane, provided f is strictly convex (resp. strictly concave) in a 
neighborhood of xo. 

23/2 


(e) Show that R = oe (In particular, observe that the osculating circle 
at A is uniquely determined by the conditions given at the statement of the 
problem.) 

(f) Show that the geometric vector 7 = (1,f’ (xo)), of origin A, is parallel to the 


line r. Show also that, upon applying to Tf a counterclockwise rotation with 


center A and angle 90°, we get a vector of the form A AO, with A € R*. 


The radius R of T° is the curvature radius of the graph of f at x (or A); the 
curvature of the graph at xo is the real number 
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Fig. 9.19 Osculating circle 
to the graph of x F> + atA 


4. 


_o of" (x0) 
~~ R = (1 + f!(x9)?)3/2’ 


where o = nt € {+1} is the sign of A. (Observe that the bigger R, the closer |k| 
is to 0. This fact reflects gives geometric intuition to the term curvature.) 


(g) LetR > Oandf : (—R,R) > R be given by f(x) = VR? — x? (so that the 
graph of f is a semicircle of radius R). Show that the curvature radius of the 
graph of f is constant and equal to R. 


As an illustration, Fig. 9.19 brings the osculating circle to the graph of f(x) = = 
x > 0, at a point A. 
The purpose of this problem is to sketch the graph of f : R > R, given by 


0, aaa 
xsin + -, sex #0- 


f@) = 


To this end, do the following items: 


(a) Show that it suffices to consider the case x > 0. 

(b) Show that the set of critical points of f in (0, +00) is the set of reals x such 
that x = 1, where y > 0 satisfies the equation sin y — ycosy = 0. 

(c) Show that the solutions y > Oto this equation form a sequence (y,,)n>1, such 
that na < yy, <n + 5 and the sequence (nz + 5 — Yn)n>1 decreases to 0 
when n — +00. 


(d) Ifx, = = for n > 1, show that x2,_; is a local minimum point and x2, is a 
local maximum point for f, for every k > 1. 
(e) Show that |f(1)| > [f(x2)| > [f(x3)| > --- > 0. 


(f) f increases in [x;, +00), with lim, + +0 f(0) —se 
(g) Mostre que f is strictly convex in (sb. Ope) and strictly concave in 


(aeenz: Tez)» for every integer k > 1. 


(h) Sketch the graph of f. 


Chapter 10 
Riemann’s Integral 


This chapter completes the task of establishing the fundamentals of Calculus, this 
time studying the operation of integration on functions. As we shall see in the next 
section, in its most simple form this reduces to the computation of areas under 
the graphs of nonnegative continuous functions f : [a,b] > R, suggesting that 
geometric intuition will play a strong role throughout. Among other byproducts of 
the coming discussions, we prove the irrationality of 2 and study the properties 
of two of the most ubiquitous functions of Mathematics, the exponential and 
logarithmic functions with base e. 


10.1 Some Heuristics II 


Our story begins in the III century B.C., with the great Archimedes, who considered 
(and solved) the problem of computing the area under an arc of parabola, by using 
the method of exhaustion. 

In modern notation and in a slightly more general situation, the heuristics behind 
such a method is the following: given a nonnegative function f : [a,b] > R (cf. 
Fig. 10.1), one wants to compute the area of the region FR of the cartesian plane, 
situated under the graph of f and above the horizontal axis, so that 


R = {(x,y) € R?;a<x<band0 <y<f(x)}. 


To this end, one starts by dividing the interval [a, b] into k equal intervals, with 
the aid of the points a = tf) < t) < +++ < % = b such that 4;-t-) = va for 
1 <i < k. Then (cf. Fig. 10.2), one considers the portion of ® contained in the 
vertical strip bounded by the straight lines x = f;_; and x = t;, and approximates its 
area, from below, by that of the greatest rectangle contained therein and having the 


[t;-1, t;] as one of its sides; accordingly, one approximates the area of that portion of 
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Fig. 10.1 The region R 
under the graph of f f 


Fig. 10.2. Approximating the 
area of R from above and 
from below 


R, from above, by that of the smallest rectangle containing it and having the interval 
[t;-1, ti] as one of its sides. 

One now approximates the whole area of ® from below and from above, 
computing, in each case, the sums of the areas of the k rectangles of the previous 
paragraph. Assuming that f attains its minimum and maximum values in each 
interval [t;-1, t;] (which is always the case if f is continuous), and letting m; and 
M; respectively denote such values (so that m; and M; are the lengths of the heights 
of the rectangles one is considering), one obtains the sums 


k k 
A(f:k) = Yo mij — t-1) = (=) So mi (10.1) 
i=1 


i=1 


and 


k k 
A(f; k) = AG —t-i)= (=) yo Mi. (10.2) 
i=l 


i=1 


Letting A(R) denote the area of R, one then gets 
A(f;k) < A(R) < A(f: 4), 


and hopes that A(f;k) and A(f;k) do approximate A(R) better and better as k > 
+00. 

In order to illustrate the difficulties involved, let’s consider the more particular 
situation of the function f : [0,b] — R given by f(x) = x", where n is a given 
natural number. As before, letting k € N and 0 = f < t <--- < &% = bbe 
the partition of [0, b] such that t; — 4) = e for 1 < i < k, we havet; = ot for 
0 <i<k. Also, since x +> x” is increasing, it follows that m; = f(t;-1) = _, and 


10.1 Some Heuristics II 


M; = f(t;) = t". Thus, 


k 
A(f:k) = ae ae 
=] 


and, analogously, 


b n+1 k 
A(f;k) = (2) YoG-) 
i=1 


It easily follows from these computations that 


n+1 


A(f;k) — A(f:k) = 


349 


(10.3) 


(10.4) 


(10.5) 


hence, A(f;k) — A(f:k) — 0 as k — +00. Therefore, at least in this case, we 
conclude that A(f; k) and A(f; k) approximate A(R) better and better, as the number 


k of rectangles increases. 


To compute the actual value of A(7), we use the result of Problem 21, page 86, 


which guarantees that 


= _ a 


k 
de-)" < 
i=1 


Combining these inequalities with (10.3) and (10.4), we get 


< yr. 


i=1 


pit 1V't! = 
A(f;k) < 1l-- < A(f;k). 
aps <2 (1-2)" agen 


3 n+ 
However, since both A(R) and =— 
nL 


[A(f: k), A(f; 4) ], relation (10.5) gives 


n n+1 
lace) —- (1 : *) | <A: &) —AUf:k) <2 


n+1 k 


Making k — +o%, we then get 


pet 1 n+1 pet 
A(R) = lim (: e *) ocala 
es) 


(10.6) 


(1 - ae belong to the closed interval 
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Fig. 10.3. Approximating the 
area under the graph of f 


Let’s now return (again in a modern setting) to the general problem of the 
computation of the area of the region under the graph of a given nonnegative 
function f : [a, b] > R (cf. Fig. 10.3). 

For each real number x € [a, b], let’s denote by A(x) the area of the region of 
the plane situated under the graph of f, above the horizontal axis and between the 
vertical lines of abscissas a and x (with the convention that A(a) = 0). For a fixed 
Xo € [a, b) and taking x9 < x < b, we have 


A(x) — A(xo) = A([xo, x]), (10.7) 


where A([xo, x]) denotes the area of the portion of the region just described, situated 
between the vertical lines of abscissas xo and x. 

For small values of x — x9 (when compared to the values of f along the interval 
[xo,x]), it is reasonable to assume that a good approximation for the value of 
A([xo, x]) is the area of the trapezoid with base lengths f(xo) and f(x) and having 
the line segment [xo, x] as one of its legs (the gray trapezoid of Fig. 10.3). Since the 
altitude of such a trapezoid equals x — xo, it follows from (10.7) that 


MAGS = Abas (er) Gai 


for small values of x — xo. This way, we conclude that 


A(x) ~AGo) _ Fo) +/@) 


x— Xo 2 


(10.8) 


for small values of x — xo, and it is also reasonable to assume that, the closest x is to 
Xo, the better is such an approximation. 
Therefore, if f is a continuous function, (10.8) suggests that the area function 
A: [a,b] > R is differentiable, with 
Al(G0) = lim SACO) _ i, (LEOAFOD) 
xX—>Xx0 x— Xo xX—>Xx0 2 


= fo), (10.9) 


In words, the heuristic reasoning above suggests that we can recover the 
continuous function f : [a,b] — R out of the corresponding area function 
. + 
A: [a,b] — R. In the case of f(x) = x", for instance, we got A(x) = aay Ne) 
that A’(x) = x”. 
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In modern language, given a nonnegative continuous function f : [a,b] > R, we 
say that the area function A : [a,b] > R is an indefinite integral for f, and write 


A(e) = / " p(Odt. (10.10) 


Here, the symbol re recalls the fact that the area under the graph of f, from a to x, 
was computed with the aid of sums that approximated the desired area better and 
better (/ is a stylized capital S); in turn, the symbol f(¢)dt recalls the fact that the 
summands of those sums had the form m;(t; — t;-1), where m; denoted the smallest 
value of f along the interval [#;_1, t;] (hence, m; = f(t) for some t € [f-1, t;]) and t;-— 
t-1 = At; was the i—th difference along the interval [a, b] (Leibniz systematically 
used the Greek letter A to denote finite differences, and such a notation survived to 
this day). 
Taking (10.9) and (10.10) together, we get 


d x 
& f fod =F00. (10.11) 


and this is essentially the content of the Fundamental Theorem of Calculus, 
which suggests that the operations of integration (i.e., of computing integrals) and 
derivation are somewhat inverses of each another. 

The rest of this chapter puts all of the above in solid grounds, developing several 
interesting applications along the way. 


10.2 The Concept of Integral 


This section introduces the concept of integral for bounded functions f : [a,b] > R 
and establishes the integrability of two important classes of such functions, namely, 
the continuous and the monotone ones. To this end, we begin by fixing some 
notations. 

A partition of an interval [a, b] is the choice of a finite subset 


P= {a =x <x <x) <0 <x =D} (10.12) 


of [a, b]. Given such a partition and a bounded (but not necessarily nonnegative) 
function f : [a,b] > R, we shall systematically denote 


m = inf{f(x); x1 Sx <xfi= inf f 
by-1.4j] 


and 


M; = supif(x); 4-1 Sx < xj} := sup f. 


[yj—14] 


Note that both m; and Mj are well defined, thanks to the boundedness of f. 
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The analogues of the lower and upper approximations for the area of the region 
under the graph of f, discussed in the previous section, are the lower sum and the 
upper sum of f with respect to P, which are respectively defined by 


k k 
s(f; P) => So myx; — xj-1) and Sf; P) = So M(x; — xj-1). 


j=l 


For a fixed partition P, we certainly have m; < M; for every j, so that s(f; P) < 
S(f; P). Hence, also in accordance with the discussion undertaken in the previous 
section, we would like to declare a bounded function f : [a,b] — R as being 
integrable if 


sup{s(f; P); P is a partition of [a, b]} = inf{S(f; P); P is a partition of [a, b]}. 


However, some care is needed, for, the inequality s(f; P) < S(f; P) (which is valid 
when we take the same partition P at both sides) does not necessarily imply a 
relation between the supremum and infimum above. This is due to the fact that 
there is no relation between the generic partition P that declares the set at the left 
hand side above and the one that declares the set at the right hand side (the choice 
of the same letter to represent them was just a matter of notational convenience). 

Thus, before we proceed towards a coherent definition of integrable function and 
integral, we ought to compare s(f; P) and S(f; Q), for two generic partitions P and 
Q of [a, b]. We do that in the coming lemma. 


Lemma 10.1 Let f : [a,b] — R be a bounded function. Given partitions P and Q 
of [a, b] such that P C Q, we have 


s(f;P) < s(f;Q) and S(f;Q) < S(f; P). 


Proof Let P be as in (10.12). We initially consider the case in which Q = P U {x’}, 
with x’ ~ x9,x1,...,x,, and leti € {1,2,...,k} be such that x; < x’ < x;. Then, 


i-l 


SQ) = >) Mj — 4-1) + ( sup AV — x1) 


j=l [i147] 
+ (sup f) (i) + > Mj (xj — xj-1)- (10.13) 
[el j=it1 


Now, since [x;—1, x’], [x’, x;] C [xi-1, x], we get 


sup f, sup f< sup f= Mi, 


Iii’) fx] [i127] 
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so that 


( sup f) (x’ — x1) + ( sup f) (xi — x’) < Mj’ — xi-1) + Mi (x —’) 


[j—1.47] [x’ xi] 
= M,(xj — x-1). 
Hence, it follows from (10.13) that 
i-] k 
S(f;Q) < So Mix; = 1) Mii = %-1) + > Mj (x; — xj-1) 
j=l j=itl 


k 
= SO M(x) —x-1) = S(f: P). 


j=l 


We now consider an arbitrary partition Q containing P. Since Q is a finite set, 
we can pass from P to Q in a finite number of steps, by adjoining to P the points of 
QO \ P, one at a time. In doing so, we obtain partitions P= P; C P2 C---C P} = Q@ 
such that, for 2 < i < J, we get P; by adding a single point to P;_;. Then, the first 
part of the proof gives 


S(f;P) = S(f; Pi) = Sf; Po) = --- = SG; Pd) = Sf; Q). 


Finally, the proof of the inequality involving lower sums is completely analogue 
and will be left to the reader (see Problem 1). oO 

As an immediate consequence of the previous lemma, given a bounded function 
f : [a,b] > R and (any) partitions P and Q of [a, b], we have 


s(f;P) < sf;PUQ) < SF; PUQ) < SF; Q). (10.14) 


In particular, given a partition Q of [a, b], the above inequalities guarantee that 
S(f; Q) us an upper bound for the set {s(f; P); P is a partition of [a, b]} of the lower 
sums of f. Hence, Theorem 7.4 guarantees that such a set does have a supremum, 
which will be denoted sup, s(f; P): 


sup s(f; P) = sup{s(f; P); P is a partition of [a, b]}. 
P 


On the other hand, since the supremum of a set bounded from above is the smallest 
of its upper bounds (one of which is S(f; Q)), we conclude that 


sup s(f; P) < S(f; Q). 
P 
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However, since the partition Q was arbitrarily chosen, we conclude that this last 
inequality is valid for every partition Q of [a,b]. To put in another way, the real 
number supp s(f; P) is a lower bound for {S(f; Q); Q is a partition of [a, b]}, the set 
of the upper sums of f. Hence, it follows from Problem 6, page 206, that such a set 
has an infimum, which will be denoted by infg S(f; Q): 


int Sf; Q) = inf{S(f; Q); Q is a partition of [a, b]}. 


Finally, since the infimum of a set bounded from below is the largest of its lower 
bounds (one of which is supp s(f; P)), it follows that 


sup s(f; P) < inf Sf; Q). (10.15) 
P 


We are finally in position to present the central definition of this chapter. 


Definition 10.2 A bounded function f : [a,b] + R is Riemann integrable! if 
sup s(f; P) = inf S(f; P). 
P P. 


As we have anticipated at the first paragraph of this section, continuous and 
monotone functions with domain [a, b] are integrable. Nevertheless, with what we 
have at our disposal at this point, it is more convenient to start by presenting the 
classical example of a bounded nonintegrable function. 


Example 10.3 Recall from Problem 11, page 255) that Dirichlet’s function is f : 
[0, 1] > R such that 


_ (0ifx¢Q 
i 


Fixed a partition P = {0 = x9 < x1 < x2 <-+- < x, = 1} of [0, 1], Problem 4, page 
206, assures that all of the intervals [xj-1 ‘ xj] contain rational and irrational numbers. 
Therefore, for 1 <j < k we have m; = 0 and M; = 1, so that 


k 
s(f; P) = Se myx; —xj-1) =0 


j=l 


‘When we look at large to Riemann’s legacy to Geometry and Analysis, we easily come to the 
conclusion that they largely surpass the adequate formalization of the concept of integral that bears 
his name. (As the attentive reader has certainly noticed, this chapter is entitled Riemann’s Integral!) 
Indeed, Semi-Riemannian Geometry gave Albert Einstein the adequate theoretical apparatus for the 
development of General Relativity. Other of Riemann’s creations, known today as Riemann’s zeta 
function, has shown to be an ubiquitous object in Number Theory. 
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and 
k k 
S(f; P) => So M(x; — xj-1) => YG — xj-1) = 1. 
j=l j=l 


Then, 


sup s(f;P) =O0<1= inf S(f; P), 
P 


and f is nonintegrable. 


In order to discuss the integrability of functions in a more transparent way, it is 
worth to recast the definition of integrable function as in the coming result, which is 
known as Cauchy’s integrability criterion. 


Theorem 10.4 (Cauchy) A bounded function f : [a,b] — R is integrable if and 
only if the following condition is satisfied: for every € > 0, there exists a partition 
P. of [a, b] such that 


S(f; Pe) — s(f; Pe) <€. (10.16) 


Proof Firstly, suppose that f is integrable, so that supp s(f; P) = infp S(f; P). Let 
this common value be denoted by /, and let € > 0 be given. Since J = supp s(f; P) 
is the least upper bound for the set of lower sums of f and J — 5 < J, there exists a 
partition P; of [a,b] such that J— 5 < s(f; P:). Analogously, since J = infp S(f; P) 
is the greatest lower bound for the set of upper sums of f and/ + 5 > J, there exists 
a partition P2 of [a, b] such that S(f; P2) < 1 + 5. Letting Pe = P, U Py, it follows 
from (10.14) that 


Sf; Pe) — s(f; Pe) < S(f; Pa) — s(f: Pi) < (1+ =) —(1- =) = 


Conversely, assume that the stated condition is satisfied. Then, given € > 0, and 
taking a partition P, as in the statement, it follows from (10.15) that 


0 < inf S(f; P) — sup s(f; P) S S(f; Pe) — s(f; Pe) <€. 
P 


However, since € > O was arbitrarily chosen, it follows that infp S(f;P) — 
supp s(f; P) = 0, as wished. Oo 


We can finally establish the integrability of monotone and continuous functions. 
Theorem 10.5 Every monotone function f : [a,b] — R is integrable. 


Proof Suppose f to be nondecreasing, the case of a nonincreasing function being 
totally analogous. Then, the image of contained in the interval [f(a), f(b)], so that it 
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is clearly bounded. Let € > 0 be given. By Cauchy’s integrability criterion, in order 
to establish the integrability of f it suffices to find a partition P = {a = x) < x < 
+++ < xy = b} of [a, b] such that condition (10.16) is satisfied for P. = P. 

Given a partition P as above, f being nondecreasing implies 


m, = inf{f(x); xj-1 < x < x} =f@-1) 


and, analogously, M; = f(x;). Hence, 


k k 
Sf; P) — s(f;P) = > Mj@j— 4-1) — Dm — 4) 


j=l j=l 


k 
= YF) — f Xj-1)) Qj — 4-1). 


j=l 


Letting 6 = max{x; — xj-1; 1 <j < k}, the last equality above and (3.15) give 


k 
Sf; P) — s(f:P) < °F) — F@j-1))5 = 8) —F(@)). 


j=l 


Therefore, it suffices to choose P such that 6(f(b) — f(a)) < €, which happens 


p ; 
whenever 6 < FD=a@H for instance. oO 


Theorem 10.6 Every continuous function f : [a,b] > R is integrable. 
Proof Corollary 8.25 guarantees that such an f is bounded. Hence, again by 
Cauchy’s integrability criterion and given € > 0, in order to establish the 
integrability of f it suffices to find a partition P = {a = x9 <x) <--- <x, = b} of 
[a, b] such that (10.16) is satisfied for P. = P. 

As in the proof of the previous of the previous result, 


k k 
S(f:P) — s(f;P) = > Myx; — 4-1) — D- myj — 4)-1) 
j=l j=l 


: (10.17) 
= )° (Mj — m)@ - 4-1). 


j=l 


Now, Theorem 8.24 assures that f is uniformly continuous. Therefore, according 
to Definition 8.22, given <’ > 0 there exists 6 > 0 such that 


x,y € [a,b] and |x —y| < 6 > [f@®) -—f6)| < €. 
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Thus, starting with a partition P such that xj — xj) < 6 for 1 < j < k, we 
conclude that 


Hu Smy <> be-yl <b > F@-SO) <e’. 
However, since Mj = sup{f(x); xj-1 < x < xj} and m; = inf{f(y); 4-1 < y < xj}, 
it easily follows from the above inequality that 

xj -Xj-1 <b > Mj-—m <€, Vi<jx<k. (10.18) 


Finally, letting P be a partition such that x; — x1 < 6 for 1 <j < k, it follows 
from (10.17), (10.18) and (3.15) that 


k 
S(f:P) — s(f;P) = S° (Mj — mj) (qj — 4)-1) 


j=l 


k 
as > €! (x; — xj-1) = e'(b _ a). 
j=l 
Therefore, in order to get S(f; P) — s(f; P) < «€ it suffices to start with <’ = Wa" 
As anticipated in the end of the previous section, if f : [a,b] > R is an integrable 
function, we let 


b 
/ f(xjdx (10.19) 


denote the integral of f on the interval [a,b]. Note also that it does not matter 
whether we denote the integral of f on [a, b] as above or writing q : f (dt. Indeed, 
such a change in notation is equivalent to changing the notation for the independent 
variable of f, which certainly does not alter the value of the integral. 

In spite of what we have done so far, unfortunately we do not have yet a general 
procedure for computing integrals of (integrable) specific functions f : [a,b] > R. 
We shall remedy this situation in Sect. 10.5. However, see Problems 2, 4, 5 and 10. 


Problems: Section 10.2 


1. * Complete the proof of Lemma 10.1 showing that, if f : [a,b] > R is bounded 
and P and Q are partitions of [a, b] such that P C Q, then s(f; P) < s(f; Q). 

2. * If f : [a,b] — R is a constant function, say f(x) = c for every x € [a, b], 
show that 


b 
i f(x)dx = c(b— a). 
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3. Let f : [a,b] — R be a monotone function and Py = {a = x9 < x1 <--- < 
xx = b} be a uniform partition of [a,b], ie., such that xj — 4-1 = boa for 
1 <j < k. Show that 


b 
/ flax = lim s(f:Pe) = lim Sif: Pe). 


4. * Given a nonnegative function f : [a,b] — R, check that the approxima- 
tions (10.1) and (10.2) for the area of the region under the graph of f are 
particular cases of lower and upper sums of f. Then, use this fact, (10.6) and the 
result of the previous problem to show that, given n € N and b > 0, we have 


b prt 
/ x"dx = : 
0 n + 1 


5. In order to compute ie ./x dx, do the following items: 


(a) Show that, for reals « > v > 0, one has 
uVit— vy) — (uJ V0) < SJE + VOU 0) 
and 
a (Vi + JIU 0) < Fut vv). 


(b) If Py = {a = x9 < xy < +++ < x, = b} is a uniform partition of [a, b], use 
the result of (a) to show that 


ong < s(x; Px) + sd vb va 


and 
s( Py) + EN ee zovb- av/a). 


(c) Conclude that [? /xdx = 2(bVb — ava). 


6. Assume that i Tai 2 — 1). (We shall compute this integral in item 
= ! oe ee ee ee 
(d) of Problem 1, page 397.) If ay = p ( sie tee tot ==) for 


n €N, show that (a,),>1 converges to 2(/2 — 1). 
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a 


11. 


We shall compute i = dx = 4 in Example 10.42. Use this fact to show that, 
ifa, =n (ote ae awe eee ee ait), then the sequence (ay)n>1 converges 
to 4 z 


: Let [a, b] > R be a bounded function and a > 0 be given. If for each k € N 


there exists a partition P; of [a, b] satisfying S(f; Px) — s(f; Px) < ¢, show that 
f is integrable, with 


b 
/ f(xjdx = im, s(f; Px) = jim. S(f; Px). 


For the coming problem, the reader might find it convenient to read again 
the statement of Problem 3, page 262. 


. Let f : [a,b] — R be a Lipschitz function, with Lipschitz constant c, and let 


Py = {a = X09 < xX) < +++ <x, = b} be a uniform partition of interval [a, b]. 


(a) Show that S(f; Px) — s(f; Px) < nae 
(b) For 1 < j < k, choose a real number &j € [xj;-1,xj] and define the sum 


Xf: Pes &) by UF; Pes &) = Die SE) Oy — xj;-1). Prove that 


b 
f(xjdx = lim X(f: Py: &). 
4 k—>+00 


. * Do the following items: 


(a) Given a,h € R, with h ¥ 2/7 for every / € Z, prove that 


k sin (a + cout) sin erpe 
Ys sin(a + jh) = > >? 
‘0 sin 5 

and 

k cos (a ae (k—- cout) sin KEDA erDh 
Y -cos(a + jh) = a ee ee 
: sin 5 
j=0 


(b) Show that i sinx dx = cosa —cosb and i cos x dx = sinb — sina. 
Let f : [0, 1] ~ R be given by 


ifx=Oorx€Q 


m 


if x = 7 with m,n € N and ged(m,n) = 1 © 


fx) = 14 4 


Prove that f is integrable. 
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10.3. Riemann’s Theorem and Some Remarks 


In this section we state and prove an important theorem of Riemann on the 
characterization of Riemann integrable functions, as well as make some other 
important remarks on the theory of integration we are developing. In particular, we 
rigorously define the concept of area for the region under the graph of a nonnegative 
integrable function, which was informally discussed in the first section of the 
chapter. 

I. RIEMANN’S THEOREM. Usually, the presentation of the concept of integral in 
Calculus courses makes use of the concept of Riemann sums. More precisely, given 
a bounded function f : [a,b] — R, one defines f as being (Riemann) integrable 
if there exists a real number / (the Riemann integral of f) satisfying the following 
condition: given € > 0, there exists 6 > 0 such that, for every partition P = {a = 
Xo < xX} < +++ < xy = b} of [a,b] and every choice of points & € [xj-1, xj], 
1 <j <k, one has 


k 
max{|x; x1); 1<j<k}<é6> | FE —H)—1] <€. (10.20) 
j=l 


In the above notations, we say that max{|x; — xj-1|; 1 <j < k} is the norm of 
the partition P and that at (§&) (aj — xj-1) is the Riemann sum of f with respect 
to P and to the chosen intermediate points § = (&;)1<j<,. Letting 


k 
|P| = max{|aj — aj]; 1s js kj and E(f;P3E) = DFE) OG — x1), 
j=l 


we summarize (10.20) by saying that J is the limit of the Riemann sums X(f; P; €), 
when |P| — 0 and for every choice § = (&;) of intermediate points relative to P. 
Moreover, in this case we simply write 


I= lim X(f;P;&). 
|P|>0 


The following result shows that the above definition of integral coincides with the 
one we have adopted in the previous section. The proof of it can surely be omitted 
on a first reading. 


Theorem 10.7 (Riemann ) A bounded function f : [a,b] — R is integrable (in 
the sense of the previous section) if and only if the limit limp|+9 X(f; P; €) of its 
Riemann sums exists, does not depending on the chosen intermediate points of &. 
Moreover, in this case one has 


b 
[ feva= lim X(f; P; &). 
a |P|>o 
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Proof We first assume that f is integrable in [a, b], with [? fxdx =1. 

Given € > 0, we want to find 6 > 0 such that, for every partition P of [a, b], it 
is the case that |P| < 6 > |X(f;P;&) —1| < ¢, for every choice & of intermediate 
points. To this end, start by choosing (by Cauchy’s integrability criterion) a partition 
Po = {a = X0 < x1 <... < xx = D} of [a, b] such that S(f; Po) — s(f; Po) < 4. 

For a general partition P = {a = yo < y) <... < y, = b} of [a, D], there are two 
different kinds of intervals (y;—1, y;): those which contain at least one of the x;’s and 
those which do not. 

If we choose 0 < 6 < 5|Pol. then |P| < 6 implies y; — y;-) < 5 (3; — xj-1) for all 
1<i<J,1 <j <k. Therefore, each interval (y;1, y;) contains at most one of the 
x;’s and each interval (x;—-1,x;) contains at least three of the y;’s. This way, we can 
write 


D(f; Ps&) = U'(f: P:&) + Uf; P: 8), 


where &/(f; P;&) (resp. X=” (f; P; €)) denotes the collection of those summands of 
X(f; P; £) corresponding to indices i such that (y;-1, yi) N Po # O (resp. (yi-1, ¥’1) N 
Po = 9). 

Let € = (&,...,&) be an arbitrary choice of intermediate points for P, and 
observe that 


|D(f; Ps €) — 1] < (Ds Ps) + |Z" Ps) - I. (10:2) 


We now let M = supy,p) [f| and estimate each of the summands at the right hand 
side above. 


(i) Estimating |&’(f; P; &)|: the restriction imposed on 6 forces X/(f; P; €) to have 
at most k—1 summands (at most one for each of x1, ..., x,-1). Since |f(&;)| <M 
and y; — y;-1 < 6, we get 


ID'S Ps 8)| = LE FEIOi — y-) S VED Oi — yi-1) < Mb(K— 1). 
(ii) Estimating |=” (f; P; €) — I|: we start by noticing that 
|=": P; &) —1| < |B"; Ps &) — s(f; Po)| + |s(fs Po) — Z| 


<|E"(f:P:8) — s(fPo)| + 5. (10.22) 


so we are left to estimating |=” (f; P; €) — s(f; Po)|. To this end, first recall that 
if 1 < i < lis one of the indices that fall into =”, then (y;-1, y;) does not 
contain any of the x;’s, but there is exactly one index 1 < j < k such that 
[vi-1. yi] C [xj-1, xj]. Hence, we can write 


Ba" P3 8) = DG PB) + + DEF Ps §), 
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where &''(f; P; €) stands for those summands of &”(f; P; ) corresponding to 
indices | < i </ such that [y;-1, yi] C [xj-1, xj]. Therefore, if mj = infpy_saxjS, 
then the triangle inequality gives 


k 


|5”(f P: 8) — s(f Po)| < D> 1D) Gs P: &) — my — x-0)1. 


j=l 


In order to deal with this last sum, fix 1 < j < k and let | < i < / be such that 
Litt, yi] C [xj-1, x]. Recall that our choice of 6 forces [x;-1, xj] to contain at least 
another point of P. Therefore, if r; < i— 1 ands; > i are respectively the least and 
greatest indices such that (V5 Ys;) C (4-1, %;), then s; — rj => 2 and we have 


Yry-t < Xr Sy <0 Sie SiS ee SVs SXF -< V5j41- 


This way, and letting S; = |X/(f; P; §) — mj(xj — xj-1)| and Mj = SUPE; xj) f> WE 
estimate S; by writing —— 


Sj 


§ =| 2 FEO - 1) = my —)-— DE mOr-y-) 


t=r+1 ttl 


A 


— m(Xj — ys;) 


S> FE) = mile = y=1) + [mn Vy, — 5-1) + Lin OG — Ys) 


t=rjt+1 


IA 


IA 


sj 
ye (Mj — mj) — yi-1) + 2|mj|d 


t=rj+1 
(Mj — mj) (Ys; — Yrj-1) + 2MB 
< (M; _ my) (xj — Xj-1) + 2M6, 


IA 


where we used the facts that |f(§) — mj| < M; — mj, 0 < y,, — xj-1,.xj — Ys; < 6 and 
Ys; —Yy-1 S Xj — Xj-1- 
Hence, we get from (10.22) that 


k 


k 
|"; P; &) — s(f; Pol < D5 5; < D6 (Mj — mj) (qj — 5-1) + 2M) 


j=l j=l 


= S(f; Po) — s(f; Po) + 2Mbk < : + 2M8k. 
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Finally, gathering together (10.21), (10.22) and the estimates of items (1) and (ii), 
we atrive at 


[=(f; P: £) —I| < = + MB(3k— 1). 


Therefore, if we further ask that 0 < 5 < we get 


a 
IM (Bk—1)” 


1 
IP| < min {5|Pol. = |X(f: Ps) —I| <e, 


€ 
2M (3k — 1) 
as we wished to show. 
The proof of the converse is left to the reader as an exercise (see Problem 9). O 


In spite of the fact that the introduction of the integral by means of Riemann 
sums was the actual way the theory historically evolved, the approach by means of 
lower and upper sums makes it technically much easier to deal with the material of 
the next section. Nevertheless, as we shall see right now, the characterization of the 
integral by Riemann’s theorem allows us to deal with the heuristic discussion of the 
previous section (concerning area computations) in a mathematically precise way. 

Given a bounded and nonnegative function f : [a,b] — R, let R denote the 
region under the graph of f (and above the horizontal axis and between the vertical 
lines x = a and x = b): 


R= {(x, y) € R?3a<x<band0<y<f(x}. 
Let Py = {a = x9 < x) < +++ < xg = b} bea uniform partition of [a, b], i.e., such 
that x; X11 = oa for 1 <j < k. 
If f is continuous, then Theorem 8.26 guarantees that, for 1 < j < k, there exist 
Ej, Ej € [xj-1, xj] such that 


f(y) = a and f(y) = ome 


Hence, the lower and upper approximations A(f;k) and A(f;k) for the area of R 
(respectively defined by (10.1) and (10.2), page 348)) respectively coincide with the 
Riemann sums X(f; Px; &) and U(f; Px; &) (here, note that & = (&j) and & = 
(&;). However, since |P;| = i it follows from Riemann’s theorem that 


b 
/ f(xdx = Jim Xf; Pri &) = eae k) 


and, analogously, 
b — 
[ fevae= lim A(f;k). 
a k+>+00 


Thus, is J (x)dx is the only reasonable value for the area of 7. 
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On the other hand, if f is merely integrable (albeit also nonnegative), then, except 
for changing the minimum and maximum values of f along [x;-1, x;] by its infimum 
and supremum along such an interval, A(f;k) and A(f;k) also reduce to the lower 
and upper sums s(f; P;) and S(f; P,), respectively. 

We sum up the above discussion in the following 


Definition 10.8 Let f : [a,b] > R be a nonnegative integrable function. We define 
the area of the region R under its graph as 


b 
A(R) = | f(xax. 


In the next section, the above definition will give heuristic arguments towards the 
validity of some of the operational properties of Riemann’s integral. 

II. LEBESGUE’S THEOREM AND INTEGRAL. In Theorem 10.6, we proved 
that every continuous function f : [a,b] — R is integrable. We also saw, in 
Example 10.3, that the Dirichlet function (which, by Problem 11, page 255, is 
discontinuous at every point of the interval [0, 1]) is not integrable. On the other 
hand, we shall prove in the next section (cf. Proposition 10.19) that a piecewise 
continuous function f : [a,b] > R (.e., a function with a finite number of points 
of discontinuity x9 € [a,b], such that for all of them the lateral limits lim,—,.+ f(x) 
do exist) is still integrable. A more dramatic example is given by the function of 
Problem 11, page 359, which is integrable but has an infinite number of points of 
discontinuity. 

The last paragraph suggests that there might be a more intimate relation between 
the integrability of a bounded function f : [a,b] — R and the size of the set Dy of 
its points of discontinuity. This is indeed the case, and a precise formulation of such 
a connection requires the concept of a set of measure zero, which we now give. 


Definition 10.9 We say that X C R is a set of (Lebesgue”) measure zero, or a 
null set if, given € > 0, there exist open intervals 1), Jo, Is, ... such that 


XC\|Jy and DOU <e, 


jel jel 


where |J;| stands for the length of J}. 


The following result links the concept of null set to that of Riemann integrable 
functions. 


Theorem 10.10 (Lebesgue) A bounded function f : [a,b] — R is Riemann 
integrable if and only if the set Dy of its points of discontinuity has measure zero. 


? After Henri Lebesgue, French mathematician of the XX century. 
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We shall not prove Lebesgue’s theorem here, and refer the interested reader to 
[1, 9] or [20]. The reason is that, apart from the Problem 4 (which will be obtained 
by more elementary methods in the coming section) and Problem 5, page 468 
(for which we present two proofs, one of which does not make use of Lebesgue’s 
theorem), we will not use it anywhere else in the book. Nevertheless, it is important 
to realize that it gives a unified explanation for the status, with respect to integration, 
of all of the functions of the next to last paragraph. Indeed, since the set of points of 
discontinuity of a continuous (resp. piecewise continuous) function is empty (resp. 
finite), hence of measure zero, such functions are integrable. On the other hand, it 
is possible to prove that Q is a null set, while [0, 1] is not (for the case of Q, see 
Problem 2); this explains why the function of Problem 11, page 359 is integrable 
(for, its set of points of discontinuity is Q /N [0, 1], which—being a subset of Q—is 
certainly a null set), as well as why the Dirichlet function is not integrable (for, its 
set of points of discontinuity is the whole interval [0, 1]). 

Let I denote the set of irrational numbers. It is possible to prove that the set 
IN [a, b], of the irrational numbers of the closed interval [a, b], is not a null set. 
Hence, Lebesgue’s theorem assures that a function f : [a,b] — R whose set of 
points of discontinuity is IN [a, b] is not integrable. However, as we stressed before, 
such a function does not exist (a proof can be found in [1], for instance). 

Lebesgue’s contributions to integration theory go far beyond Theorem 10.10. At 
the dawn of the XX century, he introduced a much more refined and flexible concept 
of integral in his doctor’s thesis. In order to get a glimpse of it, we shall first of all 
need the following 


Definition 10.11 The characteristic function of a set A C R is the function 4 : 
[a, b] > R such that 


1, ifxeA 
Xa) = 40 . 
AM = 19 ity A 
Now, let f : [a,b] — [0, L] be a nonnegative bounded function. In the context of 
Riemann’s integral, we start by choosing a partition P = {a = x9 < x1) <-+: < 
xx = b} of the domain [a,b] of f. Then, we define the step functions fp_,fp.+4 : 
[a, b] > R by letting 


k k 
P- = So mis] and fp, = So MAX, 29) 


j=l j=l 


where mj = inf, xf and Mj = supy,._,.)f- In the next section, we shall show 
(cf. Example 10.20) that such functions are Riemann integrable, with 


b k b k 
[feta = Pmt) and [for (eyae = Mio). 


j=l j=l 
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Thus, in accordance with the previous section, f is Riemann integrable if 


b b 
up - Sp —(x)dx = inf i. Tp.+(x)dx; 


moreover, in this case the integral of f coincides with this common value. 

On the contrary, in the realm of Lebesgue’s integral, we start with a partition 
Q = (0 = yw < y <-+: < y = L} of the codomain [0, L] of f. Then, for 
1 <j < k—1 we take the inverse image A; of the interval [y;_1, y;) by f, which is 
defined by 


Aj = {x € [a,b]; FQ) € Dj. yp} 


we also consider the inverse image A, of the interval [y,—1, yx], such that 


Ay = {x = la, b]; f(x) € [ye—1, Ye] }- 


If these sets are not too complicated (in a sense that doesn’t interest us at this 
moment), one can show that it is possible to associate to A; a nonnegative real 
number m(A,), which we call the Lebesgue measure of A;. Moreover, this is done 
in such a way that if A; is an interval then m(Aj;) coincides with the length of Aj; in 
this case, we say that A; is a (Lebesgue) measurable set. 

Assuming that all of the Aj’s are measurable, we consider the simple function 


k 
fo= So yj Xa, 


i=l 


and define its Lebesgue integral, denoted Jia b] to, by 


k 
fo = > yy-1m(A)). (10.23) 
[a,b] j=l 
Finally, if for every partition Q of [0, L] the resulting sets A; are measurable, 
then we say that f is a measurable function; if this is so, we define the Lebesgue 
integral of f, denoted Sia wf by 


f = sup | fo Q is a partition of 0.1] : (10.24) 
[a,b] Q 


Although we have “defined” the Lebesgue integral just for nonnegative bounded 
functions, it is possible to consider the notion of Lebesgue integral in the context 
of bounded measurable (but not necessarily nonnegative) functions f : [a,b] > 
R. (Actually, we could consider the even more general case of an unbounded 
measurable function, but this will not play a role here). 
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It is possible to show that the Lebesgue integral is more comprehensive than 
the Riemann integral, in two ways: on the one hand, every (bounded) Riemann 
integrable function f : [a,b] — R is also Lebesgue integrable and the values of 
both integrals of f coincide; on the other, there exist bounded functions f : [a, b] > 
R which are not Riemann integrable but are Lebesgue integrable (the Dirichlet 
function is one such—see Problem 5). 

Moreover, as we shall comment on Sect. 11.2 (cf. Remark 11.15), Lebesgue 
integration also has at its disposal a number of convergence results whose proofs 
are inaccessible in the context of Riemann integration, but which reveal themselves 
to be central tools for the study of the deeper properties of sequences and series 
of functions. Actually, such results are one of the main reasons behind the fact that 
Lebesgue integration is more adequate than Riemann integration for many purposes 
in Mathematics and its applications. To name one relevant example, it is completely 
indispensable to the modern study of Partial Differential Equations and Differential 
Geometry. 

If this is so, the reader might ask why not to study the Lebesgue integral in 
advance, relegating the Riemann integral to a museum. The reason is that, on the 
one hand, in spite of its greater flexibility, an adequate presentation of the concepts 
needed to develop Lebesgue’s notion of integral is considerably more complicated 
than what we have done so far to the Riemann integral (in (10.24), the sets A; may 
be very complicated); on the other hand, and as we shall see later in this chapter, 
Riemann’s integral suffices to the discussion of several interesting problems. 

For the interested reader, we suggest the references [20] or [27] for rather 
elementary introductions to the Lebesgue measure and integral. 


Problems: Section 10.3 


1. In item (c) of Problem 1, page 397, we shall show that be xsin(ax)dx = +. 


Use this fact to show that, if a, = rat a sin (), then (a,)n>1 converges 
and limy,++o0 dy = |. 

2. * Show that Q is a null set. 

3. Prove that if A,,A2,... C R are null sets, then so is Ujs1 Aj. 

4. Use Lebesgue’s theorem to prove that if f,g : [a,b] — R are Riemann 
integrable, then fg : [a,b] > R is also Riemann integrable. 

5. Show that Dirichlet’s function f : [0,1] — R (cf. Example 10.3) is Lebesgue 
integrable, with Sof = 0. 

6. Iff : [a,b] — [0, L] is a monotone function, we have showed in Theorem 10.5 
that f is Riemann integrable. Show that f is also Lebesgue integrable, with 


f? fQddx = dead: 
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7. (Berkeley) Let f : [0,+00) — [0,+00) be a continuous and increasing 
bijection. Show that 


a b 
/ Fade + i: Fal Sob: 
0 0 


for every positive reals a and b. 
8. (Leningrad) Let f,g : [0,1] — [0,1] be continuous functions, with f 
nondecreasing. Prove that 


/ ¢s0Oe= i “flac + / Pore 


9. * Complete the proof of Riemann’s theorem. 
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This section is devoted to the derivation of some useful operational properties for 
the Riemann integral. In spite of the fact that such properties will be extensively 
used along the rest of the book, their proofs can be omitted in a first reading, with 
essentially no loss of continuity. 

Along all of this section, given bounded functions f,g : [a,b] — R anda 
partition P = {a = x) < x) < +++ < xg = b} of [a,b], we shall denote the 
infimums of f and g in [x;_1, xj] by m;(f) and m;(g), respectively; accordingly, Mj(f) 
and M;(g) will denote the supremums of f and g in the same interval [x;_1, xj], also 
respectively. 

If f,g : [a,b] — R are nonnegative integrable functions, with f < g, then the 
regions Ry e R,, respectively situated under the graphs of f and g, are such that 
Rr C Rg; we therefore expect that A(R;) < A(R,). This is indeed the case, and 
we usually refer to it by saying that the Riemann integral is monotonic. The general 
case is as follows. 


Proposition 10.12 If f,g : [a,b] — R are integrable functions such that f < g, 
then Sf? fQddx < ti g(x)dx. 


Proof Letting P = {a = x9 < x1 < +++ < x, = b} denote a partition of [a, D], it 
follows at once from f < g that Mj(f) < M,(g) for 1 <j < k. Hence, 


k k 
Sf: P) = > Myf) (4 — 1) < D> Mig) (j — 4-1) = S(e: P). 


j=l j=l 


However, since : ff (x)dx < S(f; P), we conclude that 


b 
/ fxddx < S(g: P), 
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for every partition P of [a,b]. Thus, iF (x)dx is a lower bound for the set of the 
upper sums S(g; P), so that 


b b 
/ f(x)dx < inf S(g: P)= / 2(x)dx. 


oO 


For what comes next, we consider again a nonnegative integrable function f : 
[a,b] — R and a positive real number c. Problem 10, page 194, guarantees that 
the region ®,¢ under the graph of the function cf can be obtained from the region 
Ry under the graph of f by vertically stretching Ry by a factor c; we thus hope that 
A(Ref) = cA(Re). On the other hand if g : [a,b] — R is another integrable and 
nonnegative function, then, for each x9 € [a,b], the segment of the vertical line 
xX = Xo contained in Ry+, (the region under the graph of f + g) has length equal 
to the sum of the lengths of the segments of such a line which are contained in Ry 
and R,; thus suggests that Ry+¢ can be obtained by glueing Ry right above Ry and, 
hence, that we should have A(Ry+.) = A(Ry) + A(Re). 

The next result shows that the two properties of the Riemann integral hinted by 
the heuristic reasonings above are actually true. From now on, we shall refer to these 
properties by saying that the Riemann integral is respectively linear and additive. 


Proposition 10.13 Let f,g : [a,b] > R be integrable, and let c € R. Then: 
(a) cf : [a,b] > R is integrable, with p cf (x)dx = cf fax. 
(b) f+g: [a,b] > Ris integrable, with ¢@+g@)ax = Sf? fodx+f? g(x)dx. 


Proof (a) As in the proof of the previous proposition, let P = {a = x9 < x1 < 
+++ <x, = b} bea partition of [a, b]. 
If c > O, it is immediate to verify (see Problem 12, page 207) that 


mcf) = inf { = ar inf J =cm(f) 


PF 1 oy G1 AG 
and, analogously, M;(cf) = cM;(f). Thus, 
k k 
S(cf; P) = > m,(cf) (xj — xj-1) = ce, m(f) (aj — x-1) = cs(f; P) 
j=l j=l 


and, analogously, S(cf;P) = cS(f;P). The integrability of f and the elementary 
properties of the concepts of supremum and infimum (see Problem 12, page 207, 
again) give us 


b 
sup s(cf; P) = sup(cs(f; P)) = c sup s(f; P) = cf Sf (x)adx 
P P P a 
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and 
b 
inf S(cf; P)= inf(c S(f; P))=c inf s(f; P)= cf S(x)dx. 


If c < 0, the deduction of the two relations above is entirely analogous; one just 
needs to observe that mj(cf) = cM,(f), Mj(cf) = cmj(f), s(cf; P) = c S(f; P) and 
S(cf; P) = cs(f; P), so that 


b 
sup s(cf; P) = sup(c S(f; P)) = c inf S(f; P) = cf ff (x)dx 
P P sa a 


and, in the same way, infp S(cf; P) = c f? f@dx. 


In any case, we have infp S(cf; P) = supp s(cf; P) = cf fax. Therefore, cf 
is integrable, with 


b b 
/ cf (x)dx = int S(¢f; P)= a f(xdx. 


(b) Let P and Q be partitions of [a, b] such that PU Q = {a = x9 < x1 < ++: < 
x; = b}. It readily follows from Problem 13, page 207, that 


sup (f+g)< sup f+ sup g, 


[yj—-1.4)] [yj—-1 4] [xj—1 4] 
or (with respect to the partition P U Q) Mj(f + g) < M,(f) + Mj(g). Hence, 


k 
Sf +g:PUQ) => Mf + 8)04— 5-1) 


j=l 


k 
< J \(Mi(f) + Mi(g)) (xj — 4-1) 


j=l 
= S(f; PU Q) + S(g; PU Q) 
< Sf; P) + S(g; Q). 


However, since infy S(f + g;R) < S(f + g; PU Q), the inequalities above give 
inf S(f + g:R) < S(f; P) + S(g: 9), 


for all partitions P, Q and R of [a, b]. 
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In the last inequality above, taking the infimum over all partitions P and Q of 
[a, b] and invoking Problem 13, page 207, again, we get 


inf S(f + g;R) < inf{S(f; P) + S(g; Q); P, QO partitions of [a, b]} 
= inf S(f; P)+ inf S(g; Q) 
b b 
= | f@dx+ / g(x)dx. (10.25) 
Arguing as we have done up to this moment, we successively get mj(f + g) = 


m,(f) + mj(g), sf + g; PUQ) = s(f; P) + s(g; Q) and sup, s(f + g:R) > s(f: P) + 
S(g; Q). Hence, by using Problem 13, page 207, yet another time, we obtain 


sup s(f + g;R) = sups(f; P) + sup s(g; Q) 
R P Q 


b b 
= / Fore / Gide (10.26) 


Finally, since supp s(f + g;R) < infrS(f + g:;R), it follows from (10.25) 
and (10.26) that 


b b 
inf S(f +g:R)= sp s(f + g;R) = / F(x)dx + / g(x)dx. 
Thus, f + g is integrable, with [’ (f(x) + g(x))dx = [’f@dx+ f? @@dx. 


An obvious corollary of the previous proposition is that, if f, g : [a,b] > R are 
integrable functions, then f — g : [a, b] — R is also integrable, with 


b b b 
(FG) = 9(a))ax = f fear — | g(x)dx. (10.27) 


Indeed, item (a) guarantees the integrability of —f, whereas item (b) guarantees that 
of f — g = f + (—g). Relation (10.27) now follows from the formulas of items (a) 
and (b) of the previous proposition: 


i (Orie : "fO)de+ i ‘Ca@iae 


b b 
=a feyax+ (1) f g(x)dx. 


If f : [a, b] — R is a continuous function, the chain rule assures that |f| = |-|of : 
[a, b] > R is also continuous; in particular, |f| is also integrable. In what follows, 
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we establish the integrability of |f| by supposing that f is only integrable. We also 
obtain a quite useful inequality relating the integrals of f and |f|, which is known as 
the triangle inequality for integrals. 


Proposition 10.14 [ff : [a,b] — R is an integrable function, then the function 
[f| : [a,b] > R is also integrable, and the following inequality holds: 


/ : f(xdx 


Proof Let f+,f— : [a,b] — [0, +00) be given by 


b 
s / If (x) |dx. (10.28) 


f+ (x) = max{f(x), 0} and f-(@) = — min{f(x), 0}. 


It is immediate to verify that f = ff, —fL and |f| = f4 +L. Hence, if 
f+ is integrable, then the discussion immediately subsequent to Proposition 10.13 
guarantees that f_ = f; —/f is also integrable; however, this being the case, item (b) 
of that proposition assures that the same holds for |f| = f4 +/fL. 

Now, in order to prove (10.28), note first of all that the monotonicity of the 
integral, together with the fact that f- > 0, gives lies (x)dx > 0. In turn, by 
successively applying (10.27), this inequality and the formula of item (b) of the 
previous proposition we get 


i ; f(xdx = / ; fi dx — : ; f-(@)dx 


b b 
< / fears | fax 


b 
= i (fi (dx + fx 
b 
a / Flax. (10.29) 


(For another proof of (10.28) we refer the reader to Problem 7.) Analogously, 


b b b b 
= i fade = i (Nixdx < / (A @)ldx = / IF) dx, 


so that 


a) F@)|dx < [reves < [ FG) |e, 


which is equivalent to (10.28). 
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We are left to showing the integrability of f;. To this end, let P = {a = x9 < 
Xy <+++ <x, = b} be a partition of [a, b]. If f(x) > 0 for some x € [x;-1, x], then 
M;(f+) = M,(f); on the other hand, from f, > f we get m;(f+) = m,(f), so that 


Mj(f+) — m(f+) < Mj(f) — m(f). 
If f(x) < 0 for all x € [xj-1, x], then fy = 0 in [xj-1, x;] and, then, 
Mj(f+.) — m(f-) = 0 < M,(f) — m(f). 


In any case, we have Mj(f+) — mj(f+) < Mj(f) — m,(f), so that 


k 
S(f:P) — s(feiP) = So (Mi(f+) — m;(F+)) yj — ¥j-1) 


j=l 


k 
< S (Myf) — m(f))@j — 4-1) 


j=l 
= S(f; P) — s(f;P). 


Now, since f is integrable, Cauchy’s criterion for integrability assures that, given 
€ > 0, we can choose the partition P in such a way that S(f; P) — s(f;P) <e. 
However, this being the case, it follows from the last inequality above that S(f_; P)— 
s(f;;P) < e. Therefore, again by Cauchy’s criterion, we conclude that f+ is 
integrable. Oo 


Example 10.15 Given a function f : [a,b] — R, it may well happen that |f| is 
integrable but f is not. The classical example is furnished by f : [0, 1] + R such that 


—l, if x 
Ce ae 
1, ifxeQ 

An argument analogous to that of Example 10.3 guarantees that f is not integrable. 
On the other hand, [f| is constantly equal to 1, hence integrable. 


The first and last inequalities in (10.29) make it clear the validity of the 
triangle inequality for integrals: the first inequality guarantees that i : f (x)dx can 
be computed as the difference between the areas of the regions + and R_ of the 
cartesian plane, where 74 is under the graph of f and above the horizontal axis, 
while R_ is above the graph of f and below the horizontal axis; on the other hand, 
the last equality gives p If (x)|dx as the sum of the areas of R+ and R_. 

Our next result guarantees that the integral is also additive with respect to the 
domains [a, c] and [c, b] of two integrable functions. By heuristically reasoning with 
a nonnegative function f : [a,b] — R whose restrictions to the intervals [a, c] and 
[c, b] are both integrable, we conclude that this is quite a plausible result. Indeed, 
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letting R, Ryja,j and Rj, denote the regions of the cartesian plane respectively 
under the graphs of f and of its restrictions to [a, c] and [c, b], we have R = Ryja,q U 
Rip.cj» such that Ryj_,q and 7e\p,-] have no interior points in common; therefore, one 
expects that A(R) = A(Ryja,j) + A(Ryp,q)- 


Proposition 10.16 Let a function f : [a,b] — Rand a real number c € (a,b) be 
given. If the restrictions of f to the intervals |a, c] and [c, b] are both integrable, then 
f is also integrable, with 


i : f(x)dx = / : fQddx + / , F(xddx. 


Proof Let fifa, and fijcy) denote the restrictions of f to the intervals [a, c] and [c, b], 
respectively. Given € > 0, the integrabilities of f\jq,-] and fic») assure, by means of 
the Cauchy criterion, the existence of a partition P of [a, c] and Q of [c, b] such that 


€ € 
SGitacsP) — sfita.qiP) < 5 and Sica: Q) — sGfiies: Q) < 5- 
If R = PU Q, then R is a partition of [a, b] and it is immediate that 


SOR) = Sfifaqi P) + SGijeojs Q) and s(f;R) = sGfijacyi P) + 5(fi[c0): Q)- 


Hence, 


Sf; R) — sf R) = Stags P) — 8Cftaqi P) + SGfifeo}3 Q) — 8(fifc013 Q) < €, 


so that, by invoking Cauchy’s criterion once more, we deduce the integrability of f. 
For what is left to do, let A = ce fxddx— f° f(xdx— fe f(x)dx. In the notations 
of the above discussion, we have 


Cc b 
A <S(f:R)- i f@de- / Fond 


Cc b 
= S(Fta: P) + Stes; Q) — ii Fddx — / Fd 


< SfitaqiP) + SCite.o3 Q) — sGitaqiP) — sCifcn13 Q) 
= Sita: P) — sGfitaas P) + SGitco1; Q) — s(fifeos Q) < € 


analogously, 


Cc b 
A > s(f:R)- / eye / Fla)dx 


= s(fifaqi P) + 5(fifco}3 Q) — Sita: P) — SCitco3 Q) 
= —(S(fifaqi P) — 5Gifa.cq3 P)) — (SGteo)3 Q) — $Gfifes}; Q)) > —e. 


Therefore, |A| < €. Finally, since « > 0 was chosen arbitrarily, we conclude that 
A=0. Oo 
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The previous proposition shall let us present yet another class of examples of 
integrable functions. However, before we can do so, we need the following auxiliary 
result. 


Lemma 10.17 [ff,g : [a,b] — R are such that f is integrable and f = g in (a,b), 
then g is integrable and SL? f@dx a p g(x)dx. 


Proof Let’s consider the case in which f = g in [a, b), leaving the general case to 
the reader (cf. Problem 8). 


If P= {a =X <x, <-+++ <x, = Db} is a partition of [a, b], the coincidence of f 
and g in [xo, x,_1] furnishes 


by) By ay 


Is(f: P) — s(g: P)| as inf f— int g)()—x-0)| 
j=l 


| int fit 8) Gx —xi-)| 


7 Xk—1 Xk] [xe—1 xx] 
< |F() — g(b)| © — x4-1); 


analogously, 


IS(f; P) — S(g; P)| < |f(b) — gb) |b — x-1). 


It follows from the computations above and the triangle inequality that 


S(g; P) — s(g; P) = (S(g; P) — Sf; P)) + (S(f; P) — sf; P)) 
+ (s(f; P) — s(g; P)) 
S |S(g; P) — S(f; P)| + (S(F; P) — s(f; P)) 
+ |s(f; P) — s(g; P)| 
< 2|f(b) — g(b)|(b — xe-1) + (SFP) — sf; P)). 
Now, given € > 0, the integrability of f guarantees, by means of the Cauchy 
criterion, the existence of a partition P such that S(f; P) — s(f; P) < 5 and (refining 


P, if necessary) b—x,-1 < With such a partition P, the computations 


€ 
; 4([f(b)—g(b)|+ 1) ° 
above give 


S(g; P) — s(g; P) < 2|f(b) — g(b)|(6 — x-1) + (SF; P) — sf; P)) 


< 2\f(b) — g(b)| - = 


€ 
f®)—s®l+h)2~° 


Therefore, by resorting to the Cauchy criterion once more, we conclude that g is 
integrable. 
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For the equality of the integrals of f and g, note that (again by the computations 
above) 


b 
i flx)dx < S(f:P) = (S(f:P) — S(e: P)) + S(¢:P) 


< |S(f; P) — S(g; P)| + S(g; P) 
< |f(b) — g(b)|(b — xe-1) + S(g; P). 


Hence, 


b b b 
/ fd — / a(a)dx < [f(b) — g()|(b— xe-1) + Sle: P) — / elx)dx. 


Choosing the partition P in such a way that b — x4%., < and 


20F)—s®I+ 
S(g; P) - ip P g(x)dx < 5 (this last choice being possible thanks to the Cauchy 
criterion), we get 


b b b 
i fQdx — / g(a)dx < |f(6) — 90) — 4-1) + S(g; P) — i elx)dx 


€ 


€ 
<VO)-8Ol sp —ewl4 Dy 12 


<€. 


However, since € > 0 was chosen arbitrarily, it follows from the above that 


/ : fQddx — i , g(x)dx < 0. 


a 


Finally, changing the roles of f and g in the reasoning above (which is perfectly 
valid, for we have already established the integrability of g), we get the opposite 
inequality between the integrals, thus showing that they are equal. Oo 


We now need the coming 


Definition 10.18 A function f : [a,b] — R is piecewise continuous if there exist 
real numbers a = x9 < x; < --- < x, = b such that f is continuous in the 
interval (xj;-1,x;), for 1 < j < k, and the lateral limits lim,.,4 f(x), limy+— f(x) 
and lim, + f(x) exist, for 1 <j <k. 


Figure 10.4 sketches the graph of a piecewise continuous function / : [a,b] > R, 
which is discontinuous in exactly three points. 


Proposition 10.19 Jf f : [a,b] — R is piecewise continuous, then f is inte- 
grable, with 


b k Xj 
i f@da=>" / f(x)dx. (10.30) 
a j=l * 49-1 
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Fig. 10.4 A piecewise 
continuous function 
f:lab] —>R 


Proof Vf f; : [xj-1.xj] — R is such that ff = f in (%-1,xj) and fiji) = 
tim, +f), fj) = lim,+.,—f(x), then f; is continuous, thus integrable. 
Now, Lemma 10.17 assures that the restriction of f to the interval [x;-;,x;] is 
also integrable, with . S@jdx = f a _fi(@x)dx. Finally, by repeatedly applying 
Proposition 10.16, we conclude that f is integrable in [a, b] and that (10.30) is valid. 

oO 


For the coming example, the reader may find it useful to review the concept of 
characteristic function of a set, in Definition 10.11. 


Example 10.20 Let a = xo < x1 < +++ <x, = b bea partition of the interval [a, b], 
and f : [a,b] — R be defined by 


k 


f(x) = e Cj U1 .xj]> 


j=l 


with c; € R for 1 < j < k. Since f is clearly piecewise continuous, the previous 
proposition guarantees its integrability, with 


[ rea y [° foa: 


On the other hand, Problem 2, page 357, gives 
ay ay 
i. f(xjdx = i cjdx = cj(xj — xj-1). 
xj xj 
Therefore, by combining both inequalities above, we get 


k 


b 
/ f(xax = + a; —Xj-1). 


j=l 
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Our next result establishes the converse of Proposition 10.16 and will be of 
crucial importance for the actual computation of integrals, in the next section. 


Proposition 10.21 Let f : [a,b] — R be integrable and a < c < b. Then, the 
restrictions of f to the intervals [a,c] and [c,b] (which, whenever no danger of 
confusion appears, will be denoted simply by f ) are also integrable, with 


b c b 
i f(xdx = / f(x)dx + / F(x)dx. (10.31) 


Proof It suffices to prove the integrability of the restrictions of f to [a,c] and [c, 5]. 
Indeed, once we have done this, (10.31) will follow from Proposition 10.16. 

For what is left to do, given € > 0, Cauchy’s criterion assures the existence of a 
partition R of [a, b] such that S(f; R) — s(f;R) < €. Letting R’ = R U {ct}, it follows 
from Lemma 10.1 that S(f; R’) — s(f; R’) < S(f; R) — s(f; R) < €; therefore, we can 
suppose from the very beginning that c € R. 

Let f\ja,-] denote the restriction of f to the interval [a,c]. If R = {a = x9 < x < 
tee xp =O << xy, = Dh and P= {a = x < x1 <-++ < x; =c}, then Pisa 
partition of [a,c], such that 


l 


Sfifaqi P) — SGijagqiP) = YM) — my(f)) qj — x-1) 


j=l 


k 
< DOM (f) = milf) — ¥4-1) 


j=l 


= S(f;R) — s(f;R) <. 


Hence, again by Cauchy’s criterion, fija,<j 1s integrable. 
Analogously, the restriction of f to the interval [c, b] is integrable. Oo 


The last result of this section shows that the product of two integrable functions 
is also integrable. Nevertheless, as Problem 10 shows, the value of the integral of 
the product of the functions cannot be computed from the values of the integrals of 
the factors. (However, see Problem 12.) 


Proposition 10.22 /ff, g : [a,b] > R are integrable functions, then fg : [a,b] > R 
is also integrable. 


Proof Suppose we have established the integrability of the square of an integrable 
function. Then, (f + g)’, f? and g? will be integrable, so that, since fg = s((f + 
g)? —f? — g’), several applications of Proposition 10.13, together with the remark 
right after it, guarantee the integrability of fg. 

Let’s then show that f? is integrable. To this end, note first that, since f is 
bounded, there exists a real constant c such that f + c > 0 in [a,b]. Looking at 
c as aconstant function in [a, b] and writing f? = (f + c)? — 2cf — c?, we conclude 
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(quite similarly as in the above reasoning) that f? is integrable if (f + c)? has this 
property. It then suffices to show that, if f is integrable and nonnegative, then f? is 
also integrable. 

For what is left to do, assume f to be integrable and nonnegative, so that f? is 
surely bounded. Now, given a partition P = {a = x) < x <-+++ <x, = b} of [a, D], 
it follows from Problem 14, page 207, that 


sup f? =( sup f)> =M; 


[yj-1¥5] [yj—1 4] 


and, analogously, infix, x] f 2 — m;. Hence, 


k 
S(f?; P) — s(f?;P) = SMF — m7) (j — 4-1) 


j=l 


k 
= SOM; + mj) (Mj — maj — x;-1) 
j=l 


k 
< 2supf- ))(Mj — mj) (xj — 4-1) 


[a,b] j=l 


= gree (S(f; P) — s(f; P)). 


Let € > 0 be given. By invoking Cauchy’s criterion again, and thanks to the 
integrability of f, we can suppose that S(f; P) — s(f; P) < en, at Therefore, 
the above computations give 


€ 
S(f?; P) — s(f?; P) < 2(supf) - —————.——- « «. 
fab) 2(SUPta pf) + 1 
Hence, once more from Cauchy’s criterion, f? is integrable. Oo 


Problems: Section 10.4 


1. Give an example of a differentiable function f : [a,b] — R such that f’ is not 
bounded (hence, not integrable). 

2. Let f : [a,b] — R be continuous, with a finite number of zeros in the interval 
[a, b], say x1 < x2 < +++ < xy. If 


RF ={(xy, y)e€ R?:a<x<b and 0< y <f(x)} 
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and 
= {(x,y)€ R’?;a<x<b and f(x) <y<0}, 


prove that 


b 
/ f(x)dx = A(R) — A(R; ). 


For the next problem, given continuous functions f, g : [a, b] > R, such that 
f(x) < g(x) for every x € [a, b], let 


Rye = {(x,y) € R?; a <x < b and f(x) < y < g@)} 


be the portion of the cartesian plane situated between the graphs of f and g. Let 
the area of Ry be defined by 


b 
A(Rj) = / (e(x) —f))dx. 


. Prove Cavalieri’s principle: for i = 1, 2, let fi, gi : [a,b] — R be continuous 


functions such that f;(x) < gi(x) for every x € [a, b]. If, for every x € [a, b], the 
length of the line segment joining (x, fi (x)) to (x, g1(x)) equals that of the line 
segment joining (x, fo(x)) to (x, g2(x)), then A(Ry,2,) = A(Rpe,)- 


. Let |-| : R — R be the integer part function (cf. Problem 9, page 152). Given 


n € N, compute [|x| dx. 


. Let {-} : R — R be the fractional part function (cf. Problem 10, page 152). 


Given n €N, do the following items: 


(a) Show that f"""'{xldx = fy {x}de. 
(b) Compute fj {x}dx. 


. *Letf,g: [a, ‘dle — R be continuous functions such that f(x) < g(x) for every 


x € [a, bj. ff? f(x)dx = ii g(x)dx, prove that f = g. 


. Let f : [a,b] — R be an integrable function. Assuming that |f| is also 


integrable, use the fact that —|f(x)| < f(x) < [f(x)| for every x € [a,b] to 
deduce (10.28). 


. * Complete the proof of Lemma 10.17, by examining the case in which f = g 


in (a,b). 


. Given 0 < a < bandn€ N, compute f- > "dx. Then compute ih f(x)dx, where 


f= we 9 Gx’, with ap, a),...,dn € Randa, # 0. 


. Give ecaraples of continuous and nonnegative functions f, g; [a,b] > R, such 


that [” f(x)dx > O and [? g(x)dx > 0 but f? f(x)g(x)dx = 0. 


. *Letf : [a,b] — R be an integrable function. For 0 < « < £4, show that 


lim wt (x)dx = [ Ft (x)dx. 


e>0 
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12. * Prove Cauchy’s inequality for integrals: if f, g : [a,b] — R are integrable 
functions, then 


| “poses < ( / “esyeax) "( / *gads) 


Moreover, if f and g are continuous, show that equality happens if and only if 
there exists A € R such that f(x) = A g(x) for every x € [a, b]. 
13. *Iff : [a,b] — R is a continuous and convex function, prove that 


fat+fo). 1 f° 
a eS / f@dx. 
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The main purpose of this section is to turn (10.11) into a theorem. Along all that 
follows, unless explicitly stated otherwise, we shall let J be an interval of the real 
line and f : J — R be integrable in each interval [a, b] C I. 

For c € J and [a, b] C I, we set 


l “fled = 0 and i “Pods = - / “fodder 


With these conventions at hand, and applying the second part of Proposition 10.21, 
it is immediate to verify that 


b c b 
i f(xdx = i f(x)dx + / f (x)dx, (10.32) 
for every a,b,c € I (i.e., not only when a < band c € (a,b)). 


This being said, the following definition will be of paramount importance for 
what is to come. 


Definition 10.23 Let J C R be an interval and f : J — R be integrable in each 
interval [a,b] C I. Fora fixed c ¢€ J, the indefinite integral of f based at c is the 
function F : J > R, given by 

F(x) = / “f()dt. 


If f, F : 1 > R are as in the previous definition, (10.32) promptly gives 


b a b 
F(b) — F(a) = / f()dt — / f(t)dt = / f (pdt, (10.33) 


foralla,b eT. 
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We can now state and prove the major result of this section, which is known in 
mathematical literature as the fundamental theorem of Calculus (FTC). 


Theorem 10.24 (FTC) LetI C R be an interval, let f : I > R be integrable in 
each interval [a,b] C I and F : I > R be the indefinite integral of f based at c € I. 
If f is continuous at xo € I, then F is differentiable at xo, with F' (xo) = f (xo). 


Proof For x € I \ {xo} and successively applying (10.33), Problem 2, page 357, the 
additivity of the integral and the triangle inequality for integrals, we get 


F(x) — F(x) 


X— Xo 


— f(xo)| 


i) 


ny, 0 — Foo)lat). 


ey 


(In the last inequality above, the modulus outside the integral is due to the fact that, 


if x < xp, then {* [f() —f(xo)|dt = — f°" |f —f@w)ldt. 


The continuity of f at x9 guarantees that, given e > 0, there exists 6 > 0 such that 
tel, |t—xo| <6 > |fO —f@o)| <«. 

Hence, for 0 < |x — x9| < 6, we have |t — x9| < 6 for every t belonging to the 

interval with endpoints xo and x, so that [f(t) — f(x0)| < e. It thus follows from 


the monotonicity of the integral, and once more from the result of Problem 2, page 
357, that 


if Ifo) — f(%o)|at| = | eat = e|x— xo. 


Finally, the above computations assure that, for x € J such that 0 < |x —x9| < 6, 
we have 


F(x) — F(x0) 
|" — F60)| < —- ex =e. 
x— Xo |x — Xo| 
Hence, F is differentiable at xo, with F’(xo) = f (x0). | 


The FTC is the ingredient that was missing for us to get an easy way of computing 
integrals of various elementary functions. However, before we can actually do that, 
we need one more piece of terminology. 


Definition 10.25 Let 7 C R be an interval and f : J — R be integrable in each 
interval [a, b] c I. A primitive for f in / is a differentiable function F : J > R such 
that F’ =f in J. 
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In terms of the previous definition, the FTC guarantees that each indefinite 
integral of a continuous function f : J > R is a primitive of f in /. Indeed, letting 
F : I — R be the indefinite integral of F based at c € J, it follows from the FTC 
and the continuity of f in J that F’(x) = f(x) for every x € I. Yet in another way, we 
have that: 


f continuous > & [fod = f(x). (10.34) 


The coming result assures that, even for functions which are merely integrable, 
there are no other possible primitives. In other words, it guarantees that, even if f is 
just an integrable function, there are no other possible solutions for (10.37). 


Theorem 10.26 Let J C R be an interval and f : I —> R be integrable in each 
interval [a,b] C I. If F : I > R is a primitive for f, then, for a given xp € I, we 
have 


F(x) = F(xo) + / "f(t (10.35) 


for everyx € 1. 


Proof Suppose x > xo (the case x = Xo is trivial and the case x < xo can be dealt 
with in an analogous way, taking into account that f@dt =—-f “0 f(t)dt). 

Let P = {x9 < x1 < +++ < x, = x} be a partition of [xo,x]. Since F is 
differentiable in /, it is continuous in each interval [a,b] C J. Hence, Lagrange’s 
MVT gives & € (xj-1, xj) such that 


F(x) — FQj-1) = F'(&) Gj — 4-1) = FE) Oj — 441), 


for 1 <j <k. Then, we get 


k 
F(x) — Fo) = > (FG) — F@+1)) 


j=l 


k 
=) fG)@ —x1) 


j=l 
= X(f; P;&). 


Now recall that, according to Riemann’s Theorem 10.7, 


i “f(Odt = lim S(f:P:8), 
xo |P|>0 
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for every choice § = (&;) of intermediate points relative to P. Therefore, in view 
of the computations in the previous paragraph, there is no other choice than to 
have (10.35). oO 


If f : J — R has a primitive, then, thanks to the previous result, one uses to 
denote a generic primitive of f by writing [ f(t)dt. The reader must be careful not to 
making confusion with the notations [ f(t)dt and ie f (t)dt: while the first one refers 
to a differentiable function from I to R whose derivative coincides with f in J, the 
second denotes a real number. Also, note that the theorem guarantees that if { f(t)dt 
is a primitive of f in /, then the primitives of f in J are the functions of the form 


[roa C, 


where C is a real constant. 

With Theorem 10.26 at our disposal, the coming corollary assures that, if f : 
[a, b] > R is an integrable function, then, in order to effectively compute J’ ae (t)dt, 
it suffices to find a primitive for f. From now on, given a continuous function F : 
[a, b] > R, we denote 


Fo] = F(b) — F(a). 


Corollary 10.27 [ff : [a,b] > R is an integrable function and F : [a, b| > Risa 
primitive for f, then 


b x=b 
/ f()dt = F(x)| _- (10.36) 


Proof Let xp = a and x = bin (10.35). oO 


As we anticipated before, the former corollary provides us with a general strategy 
for computing the actual value of thes (t)dt: it suffices to get to visualize the 
integrand function f as the derivative of some function F, then applying (10.36) 
to obtain Sf? F(bat = F(b) — F(a). That is why we usually look at (10.36) as 


b x=b 
/ F' (dt = Fo)| 


Let us see a couple of examples. 


Example 10.28 Let f : R > R be such that f(x) = 7) qx’ for every x € R, 

where do, 4j, ..., Gy are given real numbers, with a, # 0. Since F(x) = 
=) ae is clearly a primitive of f, given reals a < b we have 

i “float = Fo) = > wt ait) 

, a GFF 


0 
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Example 10.29 Compute fy" sinxdx, [5° sin? xdx and f," cos? x dx. 
Solution Since —cos is a primitive for sin, the previous corollary gives 


x= 
= —cosz + cos0 = 2. 


x=0 


Je 
i} sinx dx = —cosx 
0 


With respect to the second integral, since sin? x = (1 — cos 2x) and o sin 2x = 
2 cos 2x, we have (again by that result) 


™ 1 f” 1 1 
/ sintxds = > [ (1 cos 2nd = 5 (x 5 sin) 


Finally, since sin? x + cos? x = 1, we have 


/ cos’ xdx = / (1 — sin*x)dx = ; Idx — / sin?x dx 
0) 0 0 0 


x=n ae 4 1 I 
_ sinnxdx =n - = —. 
0 2 2; 


x= 


mA 
x=0 2 ; 


oO 


Another way of rephrasing (10.34), which is quite important for the theory of 
ordinary differential equations’ is collected in the following result. 


Proposition 10.30 Let I C R be an interval and f : I > R be a continuous 
function. Given xo € I and yo € R, the initial value problem 


y =f) (10.37) 
y(xo) = yo 


has as its only solution the function F : I > R given by 
F(x) = yo + / f(odt. (10.38) 
x0 


Proof A solution of (10.37) is a differentiable function F : J + R such that F’(x) = 
J (x) for every x € J and F(x) = yo. Since f is continuous, (10.34) guarantees that 
F, defined as in (10.38), is a solution of (10.37). 


3In spite of being quite an interesting and important subject, we shall not have occasion to give 
a systematic development of the theory of ODE’s in these notes. For the interested reader, we 
recommend the outstanding book of professor George Simmons, [23]. 
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On the other hand, if F), F2 : 1 > R solve (10.37), then F; = f = F. Therefore, 
Corollary 9.47 guarantees that F'; — F’2 is constant. However, since (F; — F2)(xo) = 
yo — yo = 0, we conclude that F; — Fp = 0, i.e., Fj = F. oO 


Most often, the task of explicitly finding a primitive of a given integrable 
function (if possible) is not immediate. That’s why Calculus courses generally spend 
considerable time in developing specific techniques for the computation of integrals, 
which are generically referred to as integration techniques. We discuss the most 
relevant parts of this toolkit in the rest of this section and along the next one, starting 
with the formula for integration by parts. 


Proposition 10.31 /ff,g : [a,b] — R are differentiable functions with integrable 
derivatives, then 


b x=b b 
[ researc =senaco| = f rere'coae. (10.39) 


Proof Firstly, note that the integrals in both sides of (10.39) are well defined. Indeed, 
f and g, being differentiable, are continuous; on the other hand, since f” and g’ are 
integrable, Proposition 10.22 guarantees that f’g and fg’ are also integrable. 

Now, since (fg)’ = f’g + fg’, Corollary 10.27 gives 


b b x=b 
(ode) +FE)e'@)dr = | (f' Gar = Foe] 


Finally, in order to obtain (10.39), it suffices to apply the additivity of the integral to 
the left hand side of the equality above. Oo 


Example 10.32 Compute [> x sinxdx and fy x? cos x dx. 


Solution For the first integral, letting f(x) = —cosx and g(x) = x in (10.39), we 
get 


au au x= 8 
i x sin x dx =| x(— cos’ x)dx = —xeos.| -{ (— cos x)dx 
0 0 x=0 0 
1s 


x= 
=T7. 
x= 


bs 
w+ / cosxdx = a + sinx 
0 


For the second, letting f(x) = sinx and g(x) = x? in (10.39), we obtain 


x= a . 
- 2x sin x dx 
0 


x=0 


8 rs 
/ x cosxdx = / x? sin’ xdx = x? sinx 
0 0 


= -2f xsinxdx = —27, 
0 


where we used the result of the first part in the last equality above. Oo 
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Note that, in terms of primitives, the integration by parts formula gives 


/ Fede = faxes) — / Foe’ (dx. 


In words, a primitive for f’g can be obtained by subtracting a primitive for fg’ 
from fg. 
The following particular case of integration by parts is frequently useful. 


Corollary 10.33 Iff : [a,b] — R is differentiable and f’ is integrable, then 


b ; b 
/ flax = af) - / xf’ (dx. (10.40) 


Proof Changing the roles of f and g in (10.39), we get 


b ‘ b 
[ fovea =fere0o|, - [fr erecoar. 


Now, it suffices to let g(x) = x for every x € [a, b]. Oo 


Remark 10.34 The derivative of a differentiable function may not be integrable. For 
an example, see Problem |, page 379. 


Our coming example will find a prominent role in the proof of Theorem 10.56, 
in Sect. 10.7. 


Example 10.35 Letn € Z4 and I, = {"” 


9 (cosx)"dx. Show that: 


(a) nl, = (n— 1)I,—2 for every integer n > 2. 


— 2b! 2 : — (2kKN? : 
(b) Ly = Orn? * 2 for every integer k > 0, and Iy4-) = ROK) for every integer 


k>1. 
Proof (a) Forn = 2, integration by parts gives 
m/2 n/2 
i; = / (cos x)""! cosxdx = / (cosx)""! sin’ x dx 
0 0 
x/2 m/2 
= (cosx)” ! sin | -{ (n — 1)(cosx)" *(— sinx) sinx dx 

0 0 


m/2 
= (n— »f (cos .x)"-7(1 — cos? x)dx 
0 


m/2 n/2 
= (n— »[ (cos x)""7dx — (n— »f cos” x dx 
0 0 
= (n _ 1)In-2 _ (n ~ In. 


Hence, nl, = (n — 1)I,-2. 
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x= 

(b) It’s immediate that Ip) = is "dx = xf °° = 5. By induction hypothesis, 
x=0 

suppose that Io, = ae - 5 for some integer m > 0. Letting n = 2m + 2 in 


the recurrence relation of item (a), we get 


2m+ 1 2m+1 (2m)! az 
m+ 2 ~" 
(Qn+2)(2m+1) (Qm)! az 
~Q(m+))2 | "ml? 2 
(2m + 2)! sa 
= Gr DOE 


Lom42 = 


Thus, the first part of (c) is valid for every k => 0. 
Finally, the proof of the formula for J),_; is completely analogous, and will 
be left as an exercise to the reader. 
oO 


Let’s finish this section by using the FTC to present a proof of the irrationality 
of z. Our discussion somewhat follows part B.17 of another marvelous book of 
professor G. Simmons [22], and (as quoted there) is an elaboration of ideas of 
C. Hermite by I. Niven. We remark, however, that the irrationality of z was first 
established by the Swiss mathematician of the XVIII century J. H. Lambert. 


Theorem 10.36 (Lambert) — z is irrational. 


Proof If x were rational, say 7 = me with p,q € N, then we would have n= 
with p?, rh e€ N. Thus, suppose, for the sake of contradiction, that n= ae with 
a,beN. 

Let’s start by taking a twice continuously differentiable function g : [0,1] > R, 


so that 


Po 
@’ 


“(e') sin(x) — g(x) cos(x)) = (¢"(x) + 2 g(x) sin(rx). 


The FTC gives 
m(g(1) + g(0)) = (g(a) sin(erx) — srg(x) costera))| 
1 
= / (¢" (x) + 279(x)) sin(srx)dx (10.41) 


~ i ‘(e"(@) + 27g(x)) sin(sx)dx. 
0 
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Loosely speaking, Lambert’s ingenious idea was to use the fact that x? is rational 
to try to choose g in such a way that, on the one hand, g(0), g(1) € Z, while, on the 
other, 


1 
| i. (¢"(x) + 7g(x)) sin(rx)dx} < 1. 
0 


In light of (10.41), this would clearly give us an absurd. 
Actually, departing from a continuous function f and using the assumption that 


= 5 set 
g(x) = b" (02"f (x) = Callies i (x) +4 204 (4) (x) Si 
(10.42) 
= a"f (x) — ba" 1 (x) + Ba" 2fO HW —---, 
where n EN. 


The second equality above assures that 
f°) FOO) €Z, V je Z+ = g(0), g(l) €Z. 
On the other hand, a straightforward computation using the first equality 


in (10.42) gives 


ea) + g(a) = bf) = KF 


Hence, to reach the desired contradiction, f must be also such that 


2 2] (10.43) 
; 


[re sin(sx)dx 
0 


Since (from the triangle inequality for integrals) 


1 1 1 
| F(x) sin(arx)dx </ If (x) sin(srx)|dx < 7 If (x) |dx 
0 0 0 


but we have no hints on the size of a, in order to get (10.43) it is natural to choose f 
in such a way that 


Cc 
max{|f(x)|;0<x<I}<—, 
n! 
for some positive constant C. Once we have done that, we will have 


a" 
<—_, 
bn! 


y, f(x) sin(svx)dx 
0 


a 1 
b 


which, for a given a > 0, is less that 1 for a sufficiently large n. 
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We now have enough clues to search for f, and a moment’s thought shows 


1 
(Qe et 


to be a strong candidate. Indeed, for such an f we obviously have |f(x)| < 1 for 
0 < x < 1, so that it suffices to check that f” (0), f (1) € Z for every j € Z+. 

What is left to do is this is relatively easy. First of all, it follows from the binomial 
formula that 


2n 


1 
f(x) = Fae, 


* k=n 


with a, € Z forn < k < 2n. Therefore, if 0 < j < n or j > 2n, we obviously have 
f(0) = 0 and f (1) = 0. On the other hand, if n < j < 2n, then 


2n 
f9@ = a So k(k= 1)... (k= f+ Daw 
me 


_ J! k! kj 
7 ze nl (k—pyl 


2n 
= Ly ...t04 {out 
k=j J 


so that f (0) and f (1) also belong to Z. Oo 


Problems: Section 10.5 


1. In each of the items below, compute the given integral: 


(a) id —cost)*dt. 
(b) fo" V1 —costadt. 
(c) i x sin(zx)dx. 
2. In the two items below, compute the primitives: 


(a) [(sinx)? cosx dx. 
(b) fx? sinx dx. 


3. Given m,n € N, show that 


0, ifmAn 


x, ifm=n- 


2n 2 
/ sin(mx) sin(nx)dx = / cos(mx) cos(nx)dx = 
0 0 
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4. 


11. 


12. 


13. 


In each of the following items, solve the indicated initial value problem in the 
largest possible interval J Cc R: 


y= Jx " = sec? x 
" oer _ | 


. * Let g,h : [c,d] — R be differentiable functions and f : [a,b] — R be 


continuous. Show that 


d h(x) 
oF [ f(d)dt = f(hx))h' (x) — f(g) g(x). 
g(x 


. * Let f : R > R be a continuous and periodic function, with period p > 0. 


Prove that /' ie f(pdt = ff? f(t)dt for every a € R. 


. Let f : [a,b] — R be the restriction of an affine function to the interval [a, b]. 


If f is nonnegative, the region Ry is either a right triangle or a right trapezoid. 
Show that, in each of these cases, the value for the area of Ry obtained with 
the aid of the FTC coincides with the one obtained by means of the ordinary 
formulas of Euclidean Geometry for the areas of these polygons (cf. [4], for 
instance). 


. Let f : (a,b) — R be continuous, nonnegative and increasing (resp. 


decreasing). Prove that every primitive of f is strictly convex (resp. strictly 


concave). 
n 


. Given n € N, compute 77-9 TEED (”) in terms of n. 
. Prove the mean value theorem for integrals: given a continuous function f : 


[a, b] > R, there exists c € [a, b] such that 


1 b 
— 7G: 


Let f : [a,b] — R be a continuously differentiable function, with f(a) = 
f(b) = 0. Prove that 


([ sera)’ < ( [ erera)(['reora), 


(Romania) Let F be the set of continuous functions f : [0, 7] — R such that 
/ f (x) sinxdx = / f(x) cos xdx = 1. 
0 0 


Compute infpex [> f(x)" dx. 


*Letn € Z+ and I, = is *(cos x)"dx. The purpose of this problem is to show 


Lys 


that lim, +400 7 


= |. To this end, do the following items: 


392 10 Riemann’s Integral 


(a) Show that J,41 < [, for every integer n > 0. 
(b) Use item (a) of Example 10.35 to conclude that each of the sequences 


(B=) and ( tox ) is nondecreasing. 
k>1 k>1 


Typ-2 Dei 


(c) Conclude that both sequences of the previous item converge to limits Co 
and €,, say, such that £9, 2; > 0. 


-1 
(d) Show that aan = (4) . Then, let n = 2k > +00 to get fof; = 1. 


n+1 \In-1 


(e) Conclude that £)9 = €,; = 1 and, then, that lim, +40 Intl — 1, 
+ 1, 


10.6 The Change of Variables Formula 


Continuing with the development of the theory, we shall now discuss the change 
of variables formula. As the subsequent discussion will make it evident, such a 
formula will provide another useful integration technique. 


Theorem 10.37 Let f : [a,b] — R be continuous. If g : [c,d| — [a,b] is 
differentiable and g' is integrable, then 


gd 


) d 
fat = i F(e(s))2"(s)ds. (10.44) 


g(c) 


Proof Yn order to compute the integral at the left hand side, we start by taking a 
primitive F : [a,b] > R for, whose existence follows from the FTC. Then, thanks 
to Corollary 10.27, we get 


g(d) g(d) 70d) 
fdr = / F'(pde = F(o| 
8(c) g(c) x=g(c) 


s=d 
= F(g(d)) — F(g(c)) = (Fo (| 


Now, since F and g are differentiable, the chain rule guarantees that the function 
Fog: [c,d] — R is also differentiable, with 


(Fo g)'(s) = F’(g(s))g'(s) = F(g(s))8’(s) = (F © 8)(s)8"(s). 


On the other hand, since f and g are continuous, we have f o g continuous, thus, 
integrable; however, since g’ is integrable, it follows from Proposition 10.22 that 
(f o g)g’ integrable. Hence, applying Corollary 10.27 once more, we have 


s= 


. d d 
(Foo = | Foey'odr= [feos as 


s= 
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Remark 10.38 For a variant of the previous result, applicable to the case of a 
piecewise continuous f, see Problem 6. 


The corollary below isolates a simple, yet quite useful, particular case of the 
change of variables formula. 


Corollary 10.39 Iff : [a,b] — R is continuous and i € R*, then 


Ab 


b 
i f(As) ds = z f(t) dt. (10.45) 
a A ha 


Proof For 4 > 0 it suffices to let g : [a,b] — [Aa, Ab] be given by g(s) = As in 
Theorem 10.37. Indeed, with such a choice we get 


Ab e(b) b b 
| floyd = |  oat= " Fle(s))e'(s) ds = i FlAsyAds. 


For A < 0, let g : [a,b] — [Ab, Aa] be given by g(s) = As. Then, arguing as in 


the previous case and recalling that f° = — Lb and f;, =—f; Aa we obtain 
Ab Aa g(a) 
fOd=-|f f@a=- f(t) at 
a Ab g(b) 


a b 
— | Fle(s))g"(s) ds = / fAsyds. 
b a 


oO 


We shall sometimes refer to (10.44) as the formula for integration by substitu- 
tion. In what follows, we give some heuristics on such an expression. 

Given a continuous function f : [a,b] > R, in order to compute f” i (t)dt with 
the aid of the FTC we need to obtain a primitive for F’. In this sense, and after a 
careful examination of the expression of f(t), we may eventually notice that it will 
be quite simplified if we perform an adequate substitution of variables, changing t 
by an expression g(s), depending on a new variable s. 

However, since f is defined in the interval [a, b], for such a substitution to make 
sense it is necessary that g(s) belongs to the interval [a, b] when s varies in some 
interval [c, d]. In general, the best way of identifying such an interval [c, d] is solving 
inequalities a < g(s) < b, thus finding the appropriate values of c and d. Then, by 
formally differentiating the equality t = g(s), we get dt = g'(s) ds, so that the 
original integral must be corrected by the factor g’(s) before it can be transformed 
into a new integral. 

In short, the execution of both passages described below give us the formula for 
integration by substitution in any particular case we encounter: 


substitting solving 


i f(t) dt OL [ fle(O)¢ @ ds OS ia Fle(s))e'(s) ds. 
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Finally, we observe that for the last integral above to make sense, the function 
st> f(g(s))g’(s) must be integrable. Hence (as we saw in the proof of the theorem) 
it is natural to suppose that g is differentiable and g’ is integrable in [c, d]. 

Let’s give a concrete example, showing how the informal discussion of the two 
previous paragraphs simplifies the use of the change of variables formula in the 
actual computation of integrals. 


Example 10.40 Compute [OE tsin(?) dt. 


Solution Performing the substitution t = ./s, we get 


9 


[OO vsinceya [ vo — sf si d 
sin => S(SiIn Ss) —ds = — sin S ds, 
Ja 2 2.4/8 2 Jo 

1 


where the correction factor ze was obtained by formally differentiating equality 


t= 4/s, to get dt = zp ds. 


In order to correct the integration interval [./7, 2./7], we solve the inequalities 
Ja < Js < 2/7, thus getting m < s < 47. Hence, 


2/1 1 4a 1 s=4n 
i t sin(?”) dt = >/ sins ds = ~(—coss) =-l, 
ga 2x 2 


It s= 


where we used the FTC in the last equality. Oo 


Also with respect to the previous example (and in accordance to Theorem 10.37), 
note that all of the performed substitutions are valid. Indeed, what we really did was 
to use g : [2,42] > [/7,2./z] such that g(s) = /s, and such a function is 
differentiable, with g’(s) = a integrable in the interval [z, 47]. 


wa 
Example 10.41 Givenn € Z+, compute fo? (sin x)"dx. 


Solution In Example 10.35, we computed /,, = ie *(cos x)"dx. Here, we shall use 


integration by substitution to show that J, = ba 5 (sin x)"dx too. To this end, start by 
recalling that cosx = sin(> — x). Therefore, letting x = + — y (so that dx = —dy), 
we get 


m/2 ? 2 
i (sin.x)"dx = i: (sin(z/2 — y))"(—1)dy = -| (cos y)"dy. 
0 2 2 
The correct endpoints of the last integral can be easily found once we note that 


0 <4 -y < 7 if and only if 0 < y < 4. Nevertheless, one has to be careful, for, 
x=O0>y=Fandx= > > y= 0. Hence, we get 


x/2 0 n/2 
/ (sin x)"dx = -{ (cos y)"dy = i (cos y)"dy = In. 
0 m/2 0 
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From a more formal viewpoint, the last substitution above reduces to using g : 
[0, 5] — [0, 4] such that g(x) = 4 —x. Since g(0) = 4, g(4) = Oand g’(x) = —x, 
we have 


m/2 g(0) 0 
/ (sinx)"dx = / (sin x)"dx = / (sin g(x))"9" (x)dx 
0 s(F) x 


ca 
er 
xz 
2 


0 
= -{ (sin(sr/2 — x))"dx = (cos x)"dx. 
z 0 


One of the most important usages of integration by substitution is related to 
trigonometric substitution. To get to the point, suppose we have a continuous 
function f : [—1, 1] — R and wish to compute ae Ff (x)dx. Then, recalling that the 
restriction of the sine function to the interval [—4, 4] is differentiable, has integrable 
derivative and satisfies sin(—+) = —1 and sin + = 1, it follows from (10.44) (with 
g(t) = sinf) that 


1 sin 5 z 
/ f(x)dx = / f(x) dx = f (sin t) cos t dt. 
= | ¢ ) 


in(— F =3 
Depending on the algebraic expression that defines f(x) in terms of x, the last 
integral above may be much simpler to compute that the original one. 

Although the discussion on the previous paragraph has a number of useful 
variations (depending on the more adequate trigonometric function to be used), 
we shall not try to list them all. Instead, we collect two relevant examples of 
trigonometric substitutions. 


Example 10.42 Compute fy ~izdx. 


1 
TE We may use the FTC 


Proof Recalling, from Example 9.35, that a arctan x = 
to get 


1 + x2 x=0 


f 1 x=1 a 
7 dx = arctanx = arctan] = re 
0 


Alternatively, by performing the trigonometric substitution x = tan ¢ and noticing 
thatO <x< 18S 0<t< 7 we get, from the formula for integration by 
substitution and Example 9.34, 

t 


2 a 1 ; 1 5 
dx = ———— : tan’ tdt = - sec’ tdt 
o L+x? o 1+ tan?t 9 sec? t 


x 
=| ldt=t 
0 


AIS 


= A 
t=0 4 , 
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Example 10.43 Let T be the circle with center O and radius R, and choose a 
cartesian coordinate system such that O(0,0). If f : [—R,R] — R is given by 
f(x) = VR? — x’, then the region Ry under the graph of f (and above the horizontal 
axis is the half-disk bounded by IT’ and lying in the upper half-plane. Then, 


R R 
AR) =f feyde= [VR Fat, 
—R —R 
The trigonometric substitution x = Rcost, with O < t < 7, gives 
0 
A(Ry) = / VR? — R? cos? t(—R sin f)dt 


1 rt R2 
— / R’ sin? tdt = —., 
0 2 


where we used the result of Example 10.29 in the last equality. 

A similar reasoning proves that the area of the half-disk bounded by T' and lying 
in the lower half-plane also equals ae 
2: ak = mR’, as the reader surely predicted from earlier studies of Euclidean 
Geometry. 


. Hence, the area of the whole disk equals 


Remark 10.44 As was pointed out in Remark 9.12, in Chap. 11 the sine and cosine 
functions will be constructed without relying in any concepts or results of Euclidean 
Geometry. In turn, this will allow us to compute the derivatives of the sine and cosine 
functions also without reference to Euclidean Geometry. Once this has been done, 
the previous example will provide us with a genuine way of computing the area of 
a circle with radius R. 


Back to the general discussion of integration by substitution, let intervals J and 
J and functions g : J > J and f : J > R be given, such that f is continuous and 
g is continuously differentiable. It’s immediate to verify that (10.44) furnishes the 
following equality of primitives: 


( / fodt) eae / fled)’ de +C, (10.46) 


where C is a real constant. We now examine a specific relevant case. 


Example 10.45 Let I be an interval, f : I — (0,+00) be a continuously 
differentiable function and r be a nonzero rational number. Since 


d ; 
Sfey" = fF). 
X 


we see that if (x)’ is a primitive for the function x +> f(x)""!f" (x). 
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In other words, we have 
r—ly¢ 1 r 
fx)" f dx = f(x)" + C, (10.47) 
r 


an equality that can be easily remembered from (10.46), with f in place of g and 
xt» x"! in place of f: on the one hand, the FTC gives us 


fe) 1 f@) 1 
/ t dt= =| + C= —f(x)"+C; 
r r 
on the other, the variable substitution t = f(x) gives 


f(x) 
/ tf dt = / f(x)" |f' (xd. 


We conclude this section by observing that, apart from the problems collected 
below, we shall see some other relevant examples of integration by trigonometric 
substitution in the coming sections. 


Problems: Section 10.6 


1. In each of the items below, compute the given integral: 


(a) fo ae 
(0) fo? os 


(c) i Ted 


2. In each of the items below, compute the given primitives, together with their 
maximal domains of definition: 


(a) fxPV1 + x3dx. 
(b) fev 1 + x3dx. 
(c) fx2evl — x?dx. 
(d) [cos /xdx. 


3. Let f : [—a, a] — R be a continuous function. Show that: 


i _ § 2/5 F@)dx, if f is even 
ere 0. if f is odd. 


1 
4. Compute ie : ane dx and [? ed ie 


oe 
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10. 


For the coming example the reader might find it useful to review the basics on 
ellipses, in Chap. 6 of [4], for instance. 


. Prove that the area of an ellipse of principal axes of lengths 2a and 2b equals 


mab. 


. Prove the following version of Theorem 10.37: if f : [a,b] — R is piecewise 


continuous and g : [c,d] — [a,b] is increasing (resp. decreasing) and 
differentiable, with integrable derivative, then 


g(d 


) d 
fiat = J Fle(s))e' as 


g(c) 


. If f : [a,b] — [a,b] is an increasing and differentiable bijection, compute all 


possible values of F(x) + £7! (x)) dx. 
For the next problem, the reader may find it convenient to review the concept 
of reciprocal polynomial, in Problem 28, page 44. 


. Let p be a polynomial of degree n. Show that p is reciprocal if and only if 


“pO f' 24, 
1 pn/2+1 ~ x pn/2+1 2 


for every real x > 1. 


. Let f : [1, +00) > R be a differentiable function such that f(1) = 1 and f’(x) = 


Zhe for every x > 1. Prove that lim,_.+o0,(x) does exist and is at most 1+ oe 
Let J C R be an interval and g : J > I be acontinuously differentiable function 
of finite order, i.e., such that g""”) = Id; for some m € N, where g” = go...og, 
the composition of g with itself m times. If x € J and f : ] > R is a continuous 
function semi-invariant over g, in the sense that (f o g)g’ = f, compute the 


possible values of 8) ep)dt. 


10.7 Logarithms and Exponentials 


This section brings a fairly standard presentation of the main properties of two of 
the most important functions of Mathematics, namely, the natural logarithm and the 
exponential functions, postponing to the next section some interesting and important 
applications of them. 


Definition 10.46 The natural logarithm function log : (0,+00o) — R is 
defined by 


x | 
logx = / —dt. 
1 ¢ 
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Fig. 10.5 Definition of log x for x > 1 


It readily follows from the above definition that log 1 = 0, logx > Oifx > 1 
and logx < Oif 0 < x < 1. Moreover, letting f : (0,-+co) — R denote the inverse 
proportionality function, so that f(x) = + for every x > 0, then, in terms of the 
area of the region of the first quadrant situated under the graph of f (cf. Fig. 10.5), 
we have 


A (fits): ifx>1 
logx = 0, ifx=1. 
—A(fip.1]), if x < 1 


The FTC assures that log is a differentiable function, with 
‘ 1 
log x= -. 
x 
Therefore, log is infinitely differentiable and, since log’ > 0, the study of the first 
variation of functions shows that log is increasing. Also, since 


” 1 
log x= =a <0, 
x 


it follows from Corollary 9.70 that log is strictly concave. 
The coming result and their corollaries bring more important properties of the 
natural logarithm. 


Proposition 10.47 For x,y > 0, we have 


log(xy) = logx + logy. (10.48) 
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Proof For the first part, the additivity of the integral gives 


x Y x] 4 YY 
logxy = : —dt = / —dt+ / —dt = logx + i —dt. 
1 ¢ 1 ¢ yt ae: 


Now, if g : [l,y] — [x,x] is given by g(t) = xt, the change of variables 
Theorem 10.37 furnishes 


Pr 1 aly) ] y 
wat = [ war= [ Z—-9(ds= [as 
x t g(1) t 1 g(s) 
yy 
=| —-ds = logy. 
1 8S 


Corollary 10.48 Forx > Oandr € Q, we have logx" = rlogx. 


Proof Applying (10.48) with y = x we get logx* = 2logx. If we assume that 
logx* = klogx for every x > 0 and some k € N, then letting y = x* in (10.48) 
we get 


logx‘t! = log(x- x*) = logx + logx* 
= logx+ klogx = (k+ 1) logx. 


Therefore, it follows by induction that log x” = nlogx for every x > 0 and every 
neN. 


We now apply this particular case with x!/” 


in place of x to obtain 
log x = log(x!/")" = nlogx'/” 
or, which is the same, log x!/" = + log x. 


In view of the particular cases of the two previous paragraphs and letting r = 2, 
with m,n € N, we have 


1 
log x” = log x!” = log(x!/")" = mlogx'/" = m- — logx = rlogx. 
n 


In order to extend this last equality for negative rational exponents, start by letting 
y =x! in (10.48) to get 


0 = log1 = log(x- x7!) = logx + logx!, 


so that logx~! = —log.x. Therefore, if r is a negative rational and s = —r > 0, 
we get 


logx’ = log(x*)~! = —logx* = —(s- log x) = rlogx. 
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Corollary 10.49 lim,—.o+ logx = —oo and lim,-++4. log x = +00. In particular, 
Im (log) = R. 


Proof Fix a > 1. Since loga > 0, it follows from the previous corollary that 


loga” = nloga —"+ +00 and loga”" = —nloga ee ate, 
On the other hand, since 0 < 4 < 1, Example 7.12 furnishes 


n 


lim a 


= lim (1/a)"=0 and lim a”’= lim = +00. 
n—>+o0o n—>+oo n—> +00 


n>+oo q-” 


The limits above, together with the fact that log is an increasing function, 
guarantee that lim,_.94 logx = —oo and lim,++4. logx = +oo. Hence, it follows 
from Problem 7, page 290, that Im (f) = R. Oo 


We now relate the number e, defined in Example 7.41, to the natural logarithm 
function. 


Theorem 10.50 loge = 1. 


Proof First of all, note (cf. Fig. 10.6, where we sketched the portion of the graph of 
tr> + from t = ltotr=1+ +) that 


1 +5 J os r=l+y 1 1 
tog(1+-) = f wat < [ dt = =(1+2)-1=- 
n 1 t 1 t=1 n n 


and, analogously 


1 +5 | tq | n 1 1 
log{1+-—-])= —dt > 7 at= _-—= , 
n 1 t I 1+- n+1n n+1 


Hence, 


1 1 1 
<1 1+-])<-, 10.49 
n+1 oz ( = -) n ( ) 


Fig. 10.6 Estimating I 
log (1+ +) I 
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so that at < nlog (1 + +) < 1. The squeezing theorem (Problem 6, page 218) 


then gives lim,++00 n log ( 1+ +) = | or, which is the same, 


1 n 
lim toe (1 + -) =1. 
n—>+o0o n 


Now, since (according to Theorem 7.42) e = limp++4o0 (1 + 2)" and log is 
continuous, we have 


loge =log lim (1 + ~) = lim log (1 + ~) =I! 
n—-+oo n n— +00 n 
Oo 


Gathering together the information obtained so far on the natural logarithm, we 
sketch its graph on Fig. 10.7. 

The sketch above of the graph of the logarithm function suggests that it grows 
relatively slowly. The next result quantifies this suspicion. 
Theorem 10.51 For an integer n > 1, we have lim,-++00 a = 0. 


1 
ad 
ae 


: one 
Proof Firstly, observe that t > t'-3n fort > 1. Hence, 7 < for t > 1, so that, 


forx > 1, : 
x] aa | x 4 
logx = —dt < dt = t dt 
1 ¢ 1 fl 3 \ 
tm t=x 5 20fy 1 P 20/y 
a T/2n ao n(x — 1) < 2n Vx. 


Fig. 10.7 Graph of log : (0, +00) > R 
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Therefore, for x > 1 we have 


1 2 2 
0< ObF One ae di 


Now, since limy_+ +60 -— = 0, the squeezing theorem (cf. Problem 6, page 218) 


ae 


— 0 when x > +00. oO 


logx 
Be 

Since the natural logarithm function log : (0,-+oo) — R is a continuous and 
increasing bijection, we can consider its inverse 


guarantees that 


exp: R > (0, +00), (10.50) 


called the exponential function. Theorem 8.35 and Problem 8, page 176, guarantee 
that exp is also a continuous and increasing bijection. Moreover, for given x, y € R, 
with x > 0, we have 


logx = y & x = exp(); 


in particular, it follows from log! = 0 and loge = 1 that exp(0) = 1 and 
exp(1) =e. 

On the other hand, since log’ x = i # 0, Theorem 9.28 assures that exp is a 
differentiable function such that, for x > 0 and y = logx, 


1 1 
Te = = a 
exp (y) ines ve xX = exp(y). 
In short, 
exp’ = exp, 


which in particular shows that exp is infinitely differentiable. Also, since exp” = 
exp > 0, it follows from Corollary 9.70 that exp is a strictly convex function. 

The coming result translates, for the exponential function, the properties of the 
natural logarithm expressed in Proposition 10.47 and Corollary 10.48. 


Proposition 10.52 For x,y € R, we have: 


(a) exp(x + y) = exp(x) - exp(). 

(b) exp(r) =e", ifreQ 

Proof 

(a) Letting a = exp(x) and b = exp(y), we have a,b > 0 and loga = x, logb = y. 


Hence, 


x+y = loga+ logb = log(ab), 
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so that exp(x + y) = ab. The above computations also give 
exp(x) - exp(y) = ab = exp(x + y). 


(b) Since log and exp are inverses of each other and loge = 1, it follows from 
Proposition 10.47 that 


logexp(r) = r = loge’. 


Thus, the injectivity of log gives exp(r) = e’. 


Thanks to the item (b) of the former proposition, from now on we shall write 
exp(x) = e*, 


stressing that the right hand side coincides with the usual meaning of e* when x € Q, 
and defines e* when x ¢ Q. In this new notation, item (a) of the previous result is 
written as 


forallx,y € R. 

More generally, the exponential function allows us to rigorously define what 
is meant by the power a*, for positive a (and real x). Indeed, in the notation just 
introduced, we set 


at = eter (10.51) 
so that 
Gty = etty)loga _ prloga , pyloga _ ot, ay 
and 
loga® = log(e*!8*) = xloga (10.52) 


for allx,y ER. 
Notice that if r ¢ Q, say r = 2, with m € Z andn € N, Corollary 10.48 gives 


r " loga log */a™ n “ 
a’ = en 84 = eB = Va"; 


in other words, for x = r € Q definition (10.51) matches with the meaning we have 
attributed to a” up to now. In particular, we have a® = 1 and a! = a. 
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If a = 1, it follows from (10.51) that a’ = 1 for every x € R. On the other 
hand, given a positive real a ~ 1, the exponential function of basis a is f, : R > 
(0, +00) such that f(x) = a*. 

Relation (10.51), together with the properties of the exponential function (of 
basis e) and the chain rule, guarantee that f, is a differentiable bijection, with 
derivative 


f(x) = exp’!®4- loga = a‘ loga. (10.53) 
In particular, since loga > 0 & a > 1, we get 
fa is increasing @ f/>O0@a>1; 


accordingly, f, is decreasing if and only if 0 <a < 1. 

The inverse of f, is defined to be the function log, : (0,-+oo) — R, which 
is called the base a logarithm function, whose properties will be the object of 
Problem |. Thus, the natural logarithm function is just the base e logarithm function. 

An important variant of the above discussion is the following: given an interval J 
and functions f : J > (0, +00) and g: I > R, we define f(x)*™ by letting 


f( x8 = el) logs) | 


Observe that the requisite of f(x) being positive gives sense to this definition. Also, 
the chain rule applied to the right hand side of the equality above assures that the 
function x € J +> f(x)’ is differentiable whenever f and g are so. In this respect, 
see Problem 9. 

We can now state and prove a useful extension of Theorem 10.50. 


Theorem 10.53 ¢ = lim, 400 (1 + +) 


Proof A reasoning analogous to that in the proof of Theorem 10.50 gives, for x > 0, 


1 1 1 
<logjl+-—-]<-. (10.54) 
1 % x 


X+ 


It thus follows from (10.52) that log (1 + +)" = xlog(1 + 4). 
Multiplying both inequalities above by x > 0, we get 


Sie ie ot 
— <lo -—}) <1, 
x+1 é x 


so that the squeezing theorem yields 


1 x 
lim log (1 + ~) =1. 
x—>+00 x 
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Finally, since the exponential function is continuous, we can write 


1 x 
e = exp( lim toe(1 +=) } 
x—>+00 Xx 
' 1\" 
= lim exp{log({1+- 
X= “FOO XxX 
Ly 
= lim {14+-]. 
x—>+00 Xx 


We now use Theorem 10.51 to show that the exponential function increases more 
rapidly than any polynomial when x > +00. 


oO 


P(x) = 0. 


ex 


Theorem 10.54 [fp is a polynomial, then lim,—+00 
Proof Let p(x) = Ginx" + Gy—x" | +--+ + a,x + ao, with ay, 4 0. Since 


m k 


P(x) _ x 
ex = you , pe 


k=0 


xt 


it suffices to show that lim,-+4.0 a 0 for every n € N. To this end, letting y = e* 
we have x = logy and, then, 


x" _ (logy)" _ (“s) 
e y ay 


Finally, since y — +00 when x — +00, Theorem 10.51 gives 


ae ; logy\' _ logy’ 
lim —= lm — lim =0 
x>too et — y>+o0 \ wy y>too Ay 


Oo 


We close this section by deriving asymptotic estimates on the size of logn and 
n!. In other words, we estimate the sizes of logn and n! for large values of n. Our 
first result is due to Euler.* 


Theorem 10.55 (Euler) Givenn € N, we have 


n 1 
li —-1 =), 10. 
7 (> i cen) y. (10.55) 
where y is a positive real constant. 


“The Swiss mathematician Leonhard Euler is one of the most prolific mathematicians of all times, 
even if we stick only to relevant Mathematics. His contributions impressively vary from Geometry 
to Combinatorics, passing through Number Theory, Calculus and Physics. In each of these areas 
of knowledge there is at least one, if not several, celebrated Euler’s theorem. 
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Proof First of all, observe that 


n—1 


n 1 n 1 
0< dy —logn = 2 = 2 (ostk + 1) — logk) 


k=1 


->(! lo inet ase 
~ ONK é k i 


Now, since + — 0 when n — +0x, it suffices to prove that the series 


1 1 
Y (7-H (14 ;)) (10.56) 


k>1 


converges to a positive sum y. 
Applying inequality (10.49) with k in place of n, we conclude that (10.56) is a 
series of positive terms, such that 


n—-1 


n—-1 
1 1 ee 1 
Y-(—-1og(1+ = y(--——)=1--. 
(5 o8( +P ei) n 


k=1 
Therefore, Proposition 7.44 guarantees that it does converge. Oo 


The positive number y defined by (10.55) is known in mathematical literature as 
the Euler-Mascheroni constant, and its value with five correct decimal places is 
y = 0,57721. Up to this day it is unknown whether y is rational or irrational, albeit 
there is a strong suspicion that it should be irrational. 

Notice that (10.55) can be written as 


n 1 
ene Sy 10.57 
ogn dx y (10.57) 


with error less than 4 (cf. Problem 19). 
In order to motivate what comes next, let n € N. The arithmetic-geometric mean 
inequality (cf. Theorem 5.7) gives 
1 2 Saat n 1\n 
ee (Aa) = (=) 
n 2 


thus, for sufficiently large n, we have 
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On the other hand, since n + 1 < 2(n— 1) < 3(n—2) <--- forn > 3, we get 
(n!)? = (1-n)(2-(n—1))...((n— 1) 2)(n- 1) 
2 
> 2 1 n—2 = ( n ) 1 ts 
> mint I? =(—) +) 
therefore, again for sufficiently large n, 


! 1\n/2 
Ds Gas age 
n 


nl? ~ n+] 


The coming theorem refines the above naive estimates, giving a sharp asymptotic 
estimate for n!. In order to properly state and prove it, it is now convenient to 
introduce some notations. Given sequences (a,),>1 and (b;,)n>1 Of positive reals, we 
write d, ~ by and a, = o(b,) as shorthands for the following possible asymptotic 
behaviors of ae 


an ~ by & Tim <= 1 and ay = (by) lim = 0. 


n>+oo by n—>+00 Dy 
In particular, a, = o(1) if lim,++400 d, = 0. Also, if a > 0 and wa, ~ b,, then 


; b, —aa 
lim ———" =0, 
n—>+oo an 


so that 2—*% — o(1); consequently, 


qy 


by — Ay 


bn = Gn + (Dn — Gn) = (« + Ja = (a + 0(1))an. (10.58) 


an 


We are finally in position to prove the following result, due to the Scottish 
mathematician of the XVIII century James Stirling and known in mathematical 
literature as Stirling’s formula. 


Theorem 10.56 (Stirling) n! ~ /2zn (#)". 


Proof We have to show that lim, +0 (“—*) — Zz or, which is the same, that 
ki \ ne "/270Nn \ 1 
im log | ————— |] = log ——. 
n>+00 . n! . Jn 


Firstly, we shall show that the limit at the left hand side above actually exists. To 
this end, note that 


ne-n ! 1 iP 
log (=~) = nlogn—n—log = (nlogn—n) — tog 
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with 
n—1 


nlogn—n =) [((k + 1) log(k + 1) — (k + 1)) — (klogk — 
k=1 


= pe) logxdx—1 
k=l’ 


(we used the result of Problem 4 in the last equality) and 


1 (i? 1 
5 log aha (log(1 - 2) + log(2- 3) +--+ + log((n — 1)n)) 
7 s logk + log(k + 1) 
k=1 2 
Hence, 
n,—n n—l k+1 lock +1 k 1 
log ne "Jn = Da / log x dx — logk + log(k + 1)\ _ 1. (10.59) 
n!} k 2 


k=1 


For k > 1, let a = fe logxdx — S884) problem 13 (with a = k, 
b=k+1 andf = log, which is concave) furnishes 


keel logk + log(k + 1 
a= logxdx — SEE FUEETD 0, 
k 


On the other hand, by successively applying the results of Problems 4 and 20, we get 


1 k+1 1 
log («+ >)-f logxdx = log («+ 5) - e+ Dog 1)+klogh+ > 0. 
k 


Therefore, 


( ;) logk + log(k + 1) 
2 


1. (+4) 1 
= — log ———— = — log|{ 1 + ————~ 
2 k(kK+1) 2 4k(k + 1) 
1 1 1/1 1 
< =-——— = -| -- —— }, 
2 4k(k+1) 8\kK k+1 


where we used the result of Problem 5 in the last inequality above. 
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We conclude that 5°... , ax is a series of positive terms, such that 


n—-1 n—-1 1 
waa; -aa)= s 


k=1 


Thus, the comparison test assures that ei a, does converge, and it follows 
from (10.59) that there exists 


lim loe(“ we “~—)- lim Ya-t= Dank 


n—>+0o0 n—>+o0o 
k>1 


For the actual computation of the desired limit, let c > 0 be such that )°,.; ax — 
1 = loge. Since 


lim log (=~) = loge, 
n! 


n—>+oo 


loge ne" /n 
c =e" = exp = log a 
n—>+0o n. 


. ( ne" /n ) 
= lim explog | ———— 
n—>-+00 n!\ 


. n'e” n 
= lim hae : 
n—>-+00 ni 


It follows from (10.58) (with a, = n"e~" /n, by = n! anda = +) that 


we have 


= (- + o(t)) n'e" Jn. 
c 


Now recall that, according to Example 10.35, if J, = Hs *(cos x)"dx forn € N, then 
(2k)! « (2k)? 
= ——:— an = = 
7 “(OkRN2 2 AT OK(2K)! 


for every integer k > 1. Hence, 
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In (QK)N)22k x 
Iyn-1  24R(RNA 2 


[(¢ + o@) (2) eo" / TE] ae y. 
74k [(2 le o(1)) ke VE] 2 


20 k 
= as Dc", 


(4 + 0(1)) 


k 
However, Problem 13, page 391 showed that wt — 1. Therefore, the above 


computations give 2c? = 1 or, which is the same, c = prt oO 


Example 10.57 For n € N, the binomial number (*") is the largest of the 2n + 1 
coefficients of the binomial expansion of (x + y)?”. This gives 


2n = 2n 
ann ) ( )=% 
n d k 


2n aq A . . . : 
so that CC > a: Stirling’s formula considerably improves this estimate, for, with 


the aid of it, we get 


2n\ _ (2n)! Vie Ti(2) ge 
( )- eae 


n 


Problems: Section 10.7 


1. * Given 0 < a ¥ 1, prove that log, is an infinitely differentiable bijection, with 


logx j 1 
log, x = —— and log, x = ——., (10.60) 
loga 


for all O < a,x # 1. Moreover, given positive reals a, b, c, x and y, with 
a,b,c # 1, prove also that: 


(a) log = log,. 

(b) log, is increasing if a > 1 and decreasing if 0 < a < 1. 
(c) log, (xy) = log, x + log, y. 

(d) log, c = log, c- log, b. 

(e) log, b-log,c:-log.a = 1. 


16. 
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. Which of e” or x° is bigger? Justify your answer! 

. Find all a,b € N such that a 4 b and a? = b*. Justify your answer! 

. *Compute f logxdx = xlogx—x+C. 

. * Show that log(x + 1) < x for every x > 0. 

. Use Corollary 10.48, together with the result of Problem 8, page 338, to give 


another proof of the fact that log is strictly concave. 


. Find all negative values of a such that f : R > R, given by f(x) = e* + ax’, 


has a single inflection point. 


. *Leta # 0 be areal number and f : (0,00) — R be the function given by 


f(x) = x%. Show that f’(x) = ax! for every x > 0. 


. Given an interval / and differentiable functions f : / > (0, +00) andg:/] —> R, 


compute the derivative of the function h : J] + R defined by h(x) = f(x)8, 
for every x € I. 


. Given a,b > 0, let f : [0, 1] > R be defined by f(x) = x“(1 — x)’. Show that f 


attains a maximum value and compute it. 


. The set of rationals is not closed with respect to powers; for instance, if a = 


b= 3, then a,b € Q, albeit a= ee an irrational number. If a and b are 


positive irrationals, is it always true that a” is irrational? Justify your answer. 


. If ais a positive real, prove that lim,_, 4.90 (1 + 4)" = e*. 
. Find all positive values of a for which lim, +0 (4 y=, 


xX—a 


. Let f : R > R be a differentiable function such that f(0) = 0 and f’(x) > f(x) 


for every x € R. Show that f(x) > 0 for every x > 0. 


. * The purpose of this problem is to compute primitives for the function t b> 


sec? t, where sec denotes the restriction of the secant function to an interval 
of the form (-% + kz, 5 + kx), with k € Z. To this end, do the following 
items: 


(a) Show that sec’ t = tant- sect and 4 log | sect — tant| = — sect. 
(b) Conclude that f sect dt = —log| sect —tant| + C. 
(c) Write sec? t = 1 + tar rand integrate [ sec? tdt by parts to get 


[ sect vat = —log|sect—tant| + secr-tant— [ sec! rdf 
and, hence, 
3 1 
sec tdt = qi sect tant — log | sect —tant|) + C. 


* The hyperbolic sine and hyperbolic cosine, respectively denoted sinh, cosh : 
R — R, are defined by 


ee —e ete 


sinhx = and coshx = 


10.7 Logarithms and Exponentials 413 


~ _ (cost, sint) (cosh t, sinh t) 


Fig. 10.8 Circular and hyperbolic sines and cosines 


17. 


18. 


With respect to them, do the following items: 


(a) Show that cosh? x — sink’ x = 1, sinh’ x = coshx and cosh’ x = sinh x. 

(b) Show that cosh is an even and strictly convex function with image [1, +00), 
such that coshx = 16 x= 0. 

(c) Show that sinh is an odd strictly increasing function with image R. Show 
also that sinh is strictly convex in (0, +00) and strictly concave in (—oo, 0), 
and that x = 0 is its only inflection point. 

(d) Sketch the graphs of the hyperbolic sine and hyperbolic cosine in a single 
cartesian system of coordinates; in this same cartesian system, sketch also 
the graphs of x +> 4e* (x > 0) and xb Se™, x —$e™ (x < 0). 

(e) The graph of the hyperbolic cosine is known as the catenary.° Given xo € 
R, show that the curvature of the catenary at xo (cf. Problem 3, page 344) 


equals k(x) = — 


cosh2xo ° 

The next problem explains the names hyperbolic sine and hyperbolic cosine. 
To put it in context, recall (cf. Fig. 10.8, left) that the area of the circular sector 
of radius 1, contained in the first quadrant of the cartesian plane, centered at 
the origin and bounded by the radii that join the origin to the points (1, 0) and 
(cos f, sin t) equals 5. 
For all t € R, observe that the point (cosh, sinh t) belongs to the right branch 
of the hyperbola x? — y* = 1, as well as to the straight line y = (tanh f)x, where 
tanht = She For a fixed t > 0, let A be the area of the bounded portion of the 
cartesian plane bounded by the horizontal axis, the right branch of x7 — y* = 1 
and the straight line y = (tanh ¢)x (cf. Fig. 10.8, right). Show that A = 


Given A > 0, show that lim,_,94 x4 logx = 0. 


£ 
5 


>For the reader’s curiosity, the surface of revolution generated by the rotation of the catenary along 
the horizontal axis is an example of a minimal surface, in this case a catenoid. Minimal surfaces 
have very interesting properties, some of which can be found in the excellent book of professor 
M. P. do Carmo [8]. 
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19. * With respect to (10.57), show that 


0 <togn~(So 4-7) <4. 


k=1 
20. *Iff: [l, +co) > R is given by 
f(x) = log(x + 1/2) — (+ 1) log&@+ 1) 4+ xlogx+ 1, 


prove that f is positive in [1, +00). 


21. Show that lim,++o = =e. 


22. Let 0 < @ < 1 be given and k,n € N be such that k > an. Prove that a ~ 
(kKt1e)" 
J2nn * 


10.8 Miscellaneous 


This section gathers some interesting applications of the natural logarithm and 
exponential function. We start by examining three examples which show that the 
fact that we now have logarithms and exponentials at our disposal gives greater 
flexibility to the material discussed so far. 


Example 10.58 (Thailand) Find all functions f : N > R satisfying the following 
conditions: 


(a) f is increasing. 

(b) f(1) = 1 and f(2) = 4. 

(c) f(xy) = ff) for all x,y € N. 

Solution An easy induction gives f(2") = 27", for every k € N. 


Take an integer a > 2 and suppose, for the sake of contradiction, that f(a) < 
a? — 1. Since Q is dense in R, we can choose m,n € N such that 


log2?, om log 2? 


2 
log a? Th log(a? — 1) 


so that 2” < a” and (a? — 1)” < 2”". Hence, successive applications of (a) and (c) 
give us 


£2") = 2 > (a — 1)" > f(a" = f(a") > f2"), 


which is a contradiction. 
A similar reasoning shows that f(a) > a? + 1 also leads to a contradiction, so 
that the only left possibility is f(a) = a’. Oo 
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The next example shows how to use logarithms to guarantee the existence of 
powers of a fixed (and almost arbitrary) given basis beginning (from the left) by 
a prescribed set of algarisms. At this point, the reader might find it helpful to 
review the material of Sect. 7.3, specially Corollary 7.33, as well as the statement of 
Problem |, page 411. 


Example 10.59 Let a > 1 be an integer which is not a power of 10, and s be a 
finite sequence of algarisms, the first of which is nonzero. Prove that there exists a 
power of a whose decimal representation starts, from the left, with the sequence s 
of algarisms. 


Proof We want to show that there exist m,n € N for which 
s-10" <a" <(s+1)-10”. 


Taking logarithms in base 10, this is the same as showing that there exist naturals m 
and n for which log; s + m < nlogi) a < log; (s + 1) + mor, which is the same, 


logig s < —m+ nlogi a < logig(s + 1). 


If we show that log,, qa is irrational, the proof will be finished thanks to Corol- 
lary 7.33 of Kronecker’s lemma. 

Suppose, by contradiction, that log,,a € Q, say log;ya = - with p,g ¢ N 
relatively prime. Then a’ = 10’, and the Fundamental Theorem of Arithmetic (cf. 
introduction to Chap. 1), allow us to write a = 2*5'b, for some nonnegative integers 
k,l and some natural b relatively prime with 10. But these two facts give the equality 
2'45!'4h1 = 2?5?, from which we deduce that b = 1, kq = p and lq = p. Thus, we 
conclude that k = J, which would give us a = 2*5* = 10*, a contradiction. oO 


We now refine the kind of argument of the previous example to establish the best 
possible converse of Problem 16, page 11. 


Example 10.60 Prove that there exist infinitely many naturals n such that the 
decimal representations of 2” and 5” start (at left) with a 3. 


Proof Vf 2" and 5” are to begin with a 3, it is necessary and sufficient that there exist 
naturals k and / such that 3 - 10 < 2” < 4-10‘ and 3-10! < 5” < 4- 10! or, taking 
logarithms in base 10, that 


logig 3 < —k + nlogi)2 < logy) 4 and logy) 3 < —1+nlog,)5 < logy, 4. 


In turn, this is equivalent to asking that the fractional parts® of both nlog,) 2 and 
n log), 5 belong to the interval (log 3, log 4). 


Recall that the fractional part of x € R, denoted {x}, is defined as {x} = x — Lx], and belongs to 
the interval [0, 1). 
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Since 1 log)) 2 + nlog)) 5 = n and {n} = 0, such fractional parts are at the same 
distance from 5. Finally, since 5 € (ogi, 3, logy, 4), we conclude that it suffices to 
show that there exist infinitely many naturals k and n such that 


log, 3 < —k + nlogy) 2 < logy, 4. 


However, as we have shown in the solution to the previous example, log), 2 is 
irrational, so the existence of such k and n follows again from Corollary 7.33 of 
Kronecker’s lemma. Oo 


We now turn to inequalities. Up to this moment, we have given or hinted at three 
different proofs of the inequality between the arithmetic and geometric means (one 
in Theorem 5.7, another in Problem 13, page 140, and a third one in Example 9.59). 
Jensen’s inequality applied to the natural logarithm function provides a fourth one, 
which is particularly straightforward. However, before we present it, it is worth to 
point out the following generalization of (10.48), which can be easily established by 
induction: for x;,x2,...,X, > 0, we have 


log(x1x2...X%n) = logx + logx2 + ---+ log xp. (10.61) 


Example 10.61 Letn > 2 be an integer and x1, x2, ..., Xn be positive reals. Since 
log : (0,+00) — R is strictly concave, the version of Corollary 9.75 for strictly 
concave functions gives us 


n 


logx, + logx. +---+ logx, 246 (= + x9 pute), 
n 


If we successively apply (10.61) and Corollary 10.48 (with x = x,x2...x, and 
r= +) to the left hand side, we get 


log w/%ta an < log (ees) (10.62) 
n 
with equality if and only if x, = x. = --- = x,. But since log is an increasing 


function, it follows from here that 


Xy $XQ ++ +X 
n ’ 


I X{X2...Xn ZS 


with equality if and only if xy = x» =--- = Xp. 


Essentially the same idea as that of the proof of the previous example allows one 
to obtain the important Young’ inequality.’ 


7 After William H. Young, English mathematician of the XIX and XX centuries. 
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Proposition 10.62 (Young ) Let p and q be positive reals such that ; + ; = 1, 
Given a,b > 0, we have 


a pd 
ab<x—+-—, (10.63) 
P q 
with equality if and only if a? = b‘. 


Proof Given positive reals x, x2, tf; and t2 > O such that t; +f. = 1, strictly concave 
character of the natural logarithm function in (0,-++oo), together with Jensen’s 
inequality, gives 


log(t)x| + t2X2) => ft log x, + ft log x2, 


with equality if and only if xj = x2. 
Letting t; = 7 b= - x; = a? and x2 = b’, we get 


with equality if and only if a? = b?. 
Finally, since log is strictly increasing, it follows from the above inequality that 


ap bt 
—+—>ab, 
P q 
with equality if and only if a? = b?. Oo 


The inequality of the coming result is a consequence of Young’s inequality and 
is known as Hilder’s inequality.* Note that, when p = q = 2, such an inequality 
reduces to Cauchy’s inequality (cf. Theorem 5.13). Nevertheless, it is only after 
defining a’* for reals a, x > 0 that we can make sense of it. 


Theorem 10.63 (Hélder) Let a), a2,...,d, and bi, b2,..., by, be given positive 
reals, and p,q > 0 be such that * + 7 = 1. Then, 


8 After Otto Hélder, German mathematician of the XIX and XX centuries. 
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Proof Letting A = ()7j-, 4; ue and B = ()vj_ ae we have 


Daa sane D4. 


<1. 


wl > 


Now, letting x; = ¢ and y; = a, we have 


and want to prove that )~"_, x:y; < 1. But this is straightforward from Young’s 
inequality: 


ye n 
Son (fete tg 
i=1 


To have equality, we must have a = y! for | < i < nor, which is the same, 


a; bt ; P 
ay = ga for 1 <i <n. Yet in another way, we must have 
P P D Pp 
eee ce 
bi bg bi «Ba 


Conversely, it’s immediate to verify that if the above condition is satisfied then we 
do have equality. Oo 


A version of Hélder’s inequality for integrals will be the object of Problem 8 of 
the next section. 

We finish this section by discussing two more elaborate applications of Jensen’s 
inequality, the first of which appeared as a problem at the 2001 IMO. We divide it 
into two examples. 


Example 10.64 Sketch the graph of the function f : R > R given by 


1 
V1 + 8e 


Solution Firstly, note that 0 < f(x) < 1 for every x € R. Moreover, since 
limy-++400 &* = +00 and lim,-.—~o e* = 0, we have 


f(x) = 


lim f(x) =0 and lim f(x) =1. 
xX—>+00 X—>—0CO 


It follows that the graph of f is entirely contained in the strip of the cartesian plane 
bounded by the horizontal lines y = 0 and y = 1, and that these lines are asymptotes 


10.8 Miscellaneous 419 


to the graph. On the other hand, since f(0) = i, the graph intersects the vertical axis 
at the point (0, 3). 
Now, an immediate computation with the aid of the chain rule gives 


4e* 


f= "G+ bez <0, 


for every x € R, so that f decreases along the whole real line. Also, computing f” 
(again with the aid of the chain rule) we get 


4e* 


“Ta 82 Fey (1 — 4e’). 


f"@) = 


Therefore, the sign of f” coincides with that of 4e* — 1, so that f is strictly convex 
in (—2 log 2, +00), strictly concave in (—oo, —2 log 2) and has x) = —2 log 2 as its 
only inflection point. 

Finally, noticing that —2 log2 ~ —1.38, f(—2 log 2) = Fz =~ 0.58 and gathering 
together the above information, we get Fig. 10.9 as a reasonably accurate sketch for 
the graph of f. 

oO 


We now come to the promised IMO problem. 


Example 10.65 (IMO) Prove that, for every positive reals a, b and c, one has 


a b c 
———— + — + ————— 
Va+be Vb? +ac Vc? +ab 


Proof Note, first of all, that the expression at the left hand side of the inequality 
equals 


1 1 1 
——— + ——— + —_.. 
yl+8% 1+8%5 /1+84 


Fig. 10.9 Graph of the function x F> TE 
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Letting os = e, 5 = e and % = e we have x + y+ z = 0 and wish to 
prove that 


f@) +f0) +f@ = 1, 


where f : R — R is the function of the previous example. To what is left to do, 
there are three possibilities: 


(i) x,y,z > —2 log 2: Jensen’s inequality gives 


x+ty+zZ 


Fx) +f) +f@ = ar( 5 


) = 3f(0) = 1. 
(ii) x,y < —2log2 < z: then 
2 
SO) +f0) +£@ = f@) + FO) = 2f(-2 log 2) = a 1. 


(iii) x < —2log2 < y,z: again by Jensen’s inequality, we get 


yz 
2 


f(x) +£0) +f =/@) + 2 ( ) = so) +4(-3). 


If g : (—oo, —2 log 2] — R is given by 
x 
a(x) =f) + 2 (-5). 
observe that 
lim g(x)= lim f@~)+2 lim f(y) =1 
xXx—->-—00 x00 x—2>+00 


and 


1 2 
—2log2) = —=+— = 51. 
at VS NAT 


On the other hand, 


4e* 4e*/2 


x 

/ — i far’ 24 (meres = ee eae ste es ee) 
g(x) =f'x) -f ( 5) ( 8ex)3/2 + (i 8e—2/2)3/2" 
so that g’(x) < 0 if and only if —2log7 < x < —2log2. Hence, 


x € (—o0, —2log7] > g(x) > 1 
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and 
x € (—2log7, —2log2) > g(x) > g(—21log2) > 1. 
oO 


The subsequent discussion derives an inequality which widely generalizes? that 
between the arithmetic and geometric means and is known as the inequality of 
power means. We start by defining what one means for such power means. 


Definition 10.66 Given positive reals a), a2,...,d, anda € R, we define the a-th 
power mean of a), dz, ..., a, as the positive real number My = Ma(a),...,4n), 
such that 


(tweet yi" PaO 


M, = n 
“faja2...dn, fa = 0 
Theorem 10.67 Let a\,az,...,d, be given positive reals. If a < pB are any 
reals, then 
min {a;} < My < Mg < max {aj}, (10.64) 
l<i<n l<i<n 


with equality in any of the inequalities above if and only if all of the a;’s are equal. 


Proof We can assume, without loss of generality, that not all of the a;’s are equal. 

Let’s momentarily assume that (10.64) is true (with strict inequalities) for all 
0 <a < B. Given reals x < y < 0, we have —x > —y > 0; since aie are also 
positive and not all equal, it follows from our assumptions that 


; 1 1 1 1 1 1 
min ;—; <M_4y,[—,...,—]<M_.{[—,...,—]< maxj—?. 
lsix<n ( Qj; “Nay an a an ISi<n ( Qj 


Now, an easy computation gives 


1 1 
M_, (— oa) = Mig ost 


, 
a\ an 


for every t € R, so that the above inequalities are equivalent to 


= -l1 
(sna tai) < M,(a,,...,an)~! < M,(ay,...,an)7! < (i (ai) 
l<i<n ; 


° Another such remarkable generalization, due to Newton and McLaurin, is the object of Sect. 17.2 
of [5]. 
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or, which is the same, 


min {aj} < M,(a,...,Gn) < My(a1,...,an) < max {aj}. 
l<i<n l<i<n 


Hence, we conclude that it suffices to analyse the cases 0 < a < B and0 =a < Bf. 
Also without loss of generality, assume that aj = max)<j<y{a;}. Since not all of 
the a;’s are equal, we get 


Fee Serer 2 ay Ene ea 
Mp(a, +12 yn) = (a —_4| < (=) HS a= max {ai} 


and, analogously, min) <i<n{a;} < Ma. Thus, we are left to showing that 
0<a <B> Mo < My < Mg. 
The first inequality above is equivalent to 


ay tas te ta 


> Vaya, ...a% 
: 14 ’ 


which, in turn, follows immediately from the inequality between the arithmetic and 


geometric means. 
In order to prove the second inequality, letting Mg = K it suffices to show that 


1 
ay (a cea 
n 


or, which is the same, 


(Ga en ee)" 5 


n 


Letting b; = (a;/K)*, we conclude that not all of the b;’s are equal, but satisfy 


bitbote-t+b, 1 faptast+e-tap aoe 
n "KB ac 


moreover, since (a;/K)* = py p 


I 
(=| : saa 


, we want to show that 


(10.65) 


n 
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To what is left to do, let f : (0, +00) — R be given by f(x) = x%/8. Since 
0 < a/B < 1, two applications of the result of Problem 8, page 412, give 


f'@= r (5 = i) x2/B-2 < 9) 


so that f is strictly concave. It thus follows from Jensen’s inequality that 


n 


AOpen t iO) (AAP) py =i 
” n : ‘ 


which is exactly (10.65). oO 


Given d),d2,...,d, > O, it is worth noticing that the function f : R > R, 
defined for wa € R by 


f(a) = Mg(a1,..-5 an), 


is continuous. Actually, f is obviously continuous at any a 4 0, so that it suffices to 
show that 


lim f(@) = f(0). 


To this end, the continuity of x +> e* and x +> logx assures that we need only 
show that 


lim logf(a) = log f(0) 


or, which is the same, 


et) ee (10.66 


1 
lim — log ( 
a>0 a n 


However, since 


n 
af bal te taf 


lim log (A) =log1 =0, 
a0 n 


an application of I’H6pital’s rule to the left hand side of (10.66), together 
with (10.53), gives 
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1 Ol Ol Pree Ol d Ol Ol ‘a hatea Ol 
iin 6e (ee) Sites ij} 
a>0 a n n 


_ Mog a)af +--+ + (log a, ay 
= Lim —-_____——* 
a>0 ay +-+-+a% 


loga,; +---+ loga, 


n 
= log(ajaz... dn) 
= ; : 
Problems: Section 10.8 
1. Prove that equation a? = 2* has exactly three real roots, one of which is 
irrational. 
2. (Israel) Find all real solutions of the system of equations 


x + log(x+ Vx? + 1) =y 


y+logiy+ vy +)=z. 
ztlogz+V2+1) =x 


. (Croatia) Prove that there exists no polynomial function p : R — R such that 


p(x) = logx, for every x € R. 


. Givenn € N, show that the decimal representation of n has exactly |log,,7|+1 


algarisms. 


. (Brazil) Let m € N and aj, a2,...,dm € {0,1,2,...,9}, with a; ~ 0. Given 


k €N, prove that there exists n € N such that the decimal representation of n‘ 
starts (from the left) with the sequence of algarismos a)d2... dn. 


. (Russia) Show that, for any reals 1 < a < b <c, one has 


log, (log, b) ob log, (log, c) + log, (log. a) > 0. 


. (Brazil) Give an example of a function f : [0,-++co) — R such that f(0) = 0 


and f(2x + 1) = 3f(x) + 5 for every x € R. 


. (South Korea) Find all functions f : Z; — Z¥+ satisfying the following 


conditions: 


(a) 2f(m? + n*) = f(m)? + f(n)’, for all m,n € Z4. 
(b) If m,n € Z4, with m > n, then f(m7) > f(n?). 


. (Romania) Given a > 0, show that 


1 
x" 1 
lim nf ax = tog (= i} 
n>+oo Jog x" +a a 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


Let x, y and z be positive reals whose sum of squares equals 8. Prove that the 


sum of their cubes is greater than or equal to 16,/2 ; 

Prove the following generalization of the inequality between the arithmetic 
and geometric means, known as the weighted arithmetic-geometric means 
inequality: given positive reals a), a2, ..., dy, and naturals ki, ko, ..., k, such 
that + etect E = 1, we have 


k k 
a; ais: ky 
Sor ore se = a1d2...4n, 
ky ky Kn 


with equality if and only if aj = ay =... = Gy. 
(USA) Given positive real numbers a, b and c, prove that 


+b+ 
tbPct > (ey ° 
= ; 


with equality if and only ifa =b=c. 
(Leningrad) Let x1, x2, ..., x, be nonnegative reals with sum equal to 5. Show 
that 


“1-4; 2n—1)\" 
[| —< ———< J 
ttt t+xj 7 \Qn+1 
j=l 


Let 0 < x1,%2,%3,x4 < a be real numbers such that x; + x2 + x3 + x4 = 27. 
Prove that 


sin x; 16 
ye < 4s 
= Ee 


x; 
i=1 : 


(IMO shortlist) Let a, a2, ..., ad, be real numbers greater than or equal to 1. 
Prove that 


1 1 1 n 
a ee ee a 
a,t+1 aj+1 Qn +17 *wayan...d,+1 


(IMO shortlist) Let a), az, ..., a, be real numbers in the interval [0, 5]. with 
a, +a. +---+ad, < 1. Prove that 


ajda2...a,[1 — (a, + ag +--+ + a,)] 2 1 


(a, + ay +++ + a,)(1—a1)(1 — az)... 1 —a,) 7 ntl’ 
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17. Prove Minkowski’s inequality'®: given an integer k > 1 and positive reals a, 
dy, ..-, An, bj, bo, ..., by, We have 


with equality if and only if a = S Soe B®, 


10.9 Improper Integration 


The purpose of this section is to extend Riemann’s integral to functions which are 
not necessarily bounded and are defined in arbitrary intervals. 

If 7 C R is an interval, then / is of one of the forms [A, B], (A, B), [A, B), (A, BI, 
(A, +00), [A, +00), (—00,A), (—oo, A] or (—00, +00) = R, with A,B € R. In 
order to avoid the need of examining each one of these cases separately, and for 
general notational convenience, we: 


(i) shall refer to the real numbers A, B, or to the symbols —oo, +00 as the endpoints 
of I (albeit —oo and +00 are just formal symbols, and not real numbers), which 
will be generically denoted by a and f; 

(ii) impose that —co < A, B < +00, forall A,B eR. 


Thus, if J has endpoints a and f, witha < f, then 


(a, B) CI and I\ (a, B) C {a, B}. 


For instance, if J has endpoints a = A and B = +00, we have (A, +00) C J and 
I\ (A, +00) C {A, +00}, so that J = (A, +00) or [A, +00). 

Let / be a fixed interval and f : J — R be a given function, such that f is 
integrable in each interval [a,b] C J. (In particular, if J = [a,b], then we assume 
that f is integrable in J.) This implies that f is bounded in each interval [a,b] C J, 
albeit it may well happen that f is not bounded in J. (Nevertheless, recall that if f is 
continuous in /, then, even if f is unbounded in J, Weierstrass’ Theorem 8.26 assures 
that it is bounded in each interval [a, b] C J.) 


Definition 10.68 Let J Cc R be an interval with endpoints w and 6, witha < B, 
and f : J > R be integrable in each interval [a,b] C I. We say that f is integrable 


10 After Hermann Minkowski, German mathematician of the XIX and XX centuries. Minkowski 
made several seminal contributions to different branches of Mathematics. His geometric method 
on Algebraic Number Theory proved to be quite fruitful for the development of that theory. Also, 
Minkowski’s four dimensional space is the correct setting for the geometric description of special 
relativity. 
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in / if, for some xo € (a, B), the limits 


XO iy 
lim / f(t) dt and lim, / f(t) dt (10.67) 
xa Jy YB J xo 


do exist. This being the case, we define the improper integral of f in J, denoted 
LPO dt, by setting 


B xo y 
/ f(t) dt = lim / f(t)dt+ ay / ff dt, 
Py xa Jy yo xo 
and say that the improper integral is f(t) dt converges. 
It is important to realize that the former definition does not depend on the chosen 


Xo € I. More precisely, suppose that x9 € J is chosen so that the limits (10.67) do 
exist, and let Xp be another point in 7. Then, 


[rou- [roas [roa 


/ 
so that the limit limy+a /” “0 f(t) dt also exists, with 


xo xo A 
lim / f(t) dt = lim i f@dt+ / f(t) dt. 
xa x xa x x0 
Analogously, lim,_,g i J (t) dt does exist, with 
= 0 
y x0 y 
lim | f()dt= / f(t) dt + lim / F(t) dt. 
y>B XG xo y>B xo 
Therefore, since [ %9 f(dt + [” f(d) dt = 0, we get 
0 


xo y Xo y 
lim / f() dt + lim, / f(t) dt = lim / f()dt+ lim / f(t) dt. 
xa x y> xh xa x yo x0 


On the other hand, in case J = [a, b], we claim that the definition of integral given 
in Chap. 10 coincides with the present one. In order to check that, let f : [a,b] > R 
be (bounded and) integrable in [a, b], and fix x) € (a,b). Then f is integrable in 
[a, Xo] and, for x € (a, xo), we have 


/ plo) dt = : "FQ at — i: "pd dt. 
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Letting M be an upper bound for |f| in [a, b], it follows from the triangle inequality 
for integrals and Proposition 10.12 that 


[roa 


so that the squeezing principle gives lim,_,, J ‘ f(t) dt = 0. Therefore, 


lim / ” 60) dt = / ” £0) dt — lim / “$(0 dt = i ” £0) de. 


Since f is also integrable in [xo, b], a similar reasoning allows us to conclude that 


< [ Y@ldt <M(x—a), 


y b b b 
tim [Far = bs foo ar— tim | f@dt= [ so. 


Finally, by invoking Proposition 10.21, we get 


lim / ” FO) dt + lim / ; f(dt= / ” F(t) dt + i" f()dt= 7, , f() dt. 


Thanks to the above discussion, we can restrict the study of improper integrals 
to the cases in which / = (A, B], [A, B), (—oo, B] or [A, +00), with A, B € R. Along 
the rest of this section, we stick to the cases J = [A, B) and [A, +00), leaving to the 
reader the task of formulating the analogous results when J = (A, B] or (—oo, B]. 

Let’s start by looking at two examples. 


Example 10.69 The improper integral i e ‘dt does converge. More precisely, 
we have i etdt=1. 


Proof For x > 0, it follows from the FTC that 


x t 
: e'dt=-—e" 
0 


t=0 


eh ee 


Now, Theorem 10.54 gives lim,++o0 e * = 0, so that i; e'dt=1. oO 


Example 10.70 The improper integral /; had t* dt converges if and only ifa < —1. 


Proof For x > 1 anda 4 —1, note that 


x 1 
i ‘had dt = — etl 
1 a+l1 


o+l _ (atl) log x 


t=x 1 
= —_(°t! - 1), 10.68 
oe 7@ ) (10.68) 


Now, since x' , we have (again by Theorem 10.54) that 


lim x* 
X—> +00 


41 __ f0, ifa+1<0 
~ | +oo0, ifa+1>0° 
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Hence, the right hand side of (10.68) has a (finite) limit when x — +00 if and only 
ifa+1<0,ie., if and only ifa < —1. 
Finally, if « = —1, then 


x 1=x 
/ t | dt =logt = logx > +00 
1 t=1 


when x > +00. oO 
Also with respect to the previous example, since 


Ol 


Z x : 
lim —= lim xt! =0 
x—>+00 xl X—>+00 


if and only if a < —1, we use to say that Pies t* dt converges if and only if x* 
decays to zero more rapidly than x! when x —> +00. See also Problem 3, which 
deals with the case i (dt. 

The coming theorem is the fundamental result on the convergence of improper 
integrals. Note that it is the analogue, for improper integrals, of Theorem 7.27. For 
its statement, as well as for remark just following its proof, the reader may find it 
convenient to read again Problem 8, page 218. 


Theorem 10.71 Let f : [A, +00) — R be integrable in each interval [a,b] C 
[A, +00). The following assertions are equivalent: 

(a) f has f(t) dt converges. 

(b) For every € > 0, there exists M > A such that x|,x. > M => Se fO dt <é. 


(c) There exists L € R such that, for every sequence (Xn)n>1 in [A, +oo) and 
satisfying liMy—++o0 Xn = +00, one has limy++00 ii f(jdt=L. 


Proof Let F : [A, +00) — R be defined by F(x) = fi f(0) dt. 

(a) = (b): assume that haa kG dt converges, with vad dt = L. By 
definition, we have lim,_, +40 F(x) = L, so that, given e > 0, there exists M > 0 for 
which 

€ 
x>M=> |FQ)-L| < % 


Hence, for x,,x. > M, triangle inequality gives 


i : f(t) dt 


= |F(@2) — F(@x)| 
< |FQ@2) -L| + |L—Fx)| 


Pre: 
-~-+ =e. 
2 2 
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(b) = (c): assume that condition (b) holds, and let (x,),>1 be a sequence such 
that x, => A for every n > 1 and lim,-+460 X, = +00. Given € > 0, take M as in the 
statement and ny € N such thatn > np => x, > M. Then, for m,n > no, we have 
XmsXn > M, so that 


Xm 


f(t) dt 


Xn 


|F (Xm) — F(x,)| = <€. 


Therefore, (F'(%n))n>1 1s a Cauchy sequence, thus convergent, by Theorem 7.27. 

If L = limp++o0 Fin), we claim that lim,++00 F(x’,) = L for every sequence 
(x), )n>1 Satisfying x), > A forn > 1 and lim,++o0 x1, = +00. Indeed, the discussion 
in the previous paragraph (with (x/,),>1 in place of (x,)n>1) assures the existence of 
L’ = limy++00 F(x,). However, if it was L’ # L, then the sequence (x”’),>1, given 
fork > 1 by xj, = xox-1 and x5, = x4,, would also converge to +00 and be 
such that 


. ” _ : nN a a 
im, F(xx,_,;) = L and im, F(x;,) =L'. 


Since L # L’, the limit lim,++o0 F(W/) would not exist, which contradicts the 
argument of the previous paragraph (this time with (x’),>1 in place of (%n)n>1). 

(c) = (a): suppose that there exists L ¢€ R such that, for every sequence 
(%n)n>1 in [A, +00) with limy++o0%, = +00, we have limy++400 F(%,) = L. If 
lim,—++00 F(x) does not exist or is different from L, then there exist « > 0 and, for 
every B > A, areal number xg > B such that |F (xg) —L| > €. Taking B successively 
equal toA+ 1,4 +2,A+3,..., we would get a sequence x, x2, x3, ... Satisfying 
X, > A+nand |F(x,) — L| => € for every n > 1. Therefore, limp++40 X, = +00 
but (F(%n))n>1 does not converge to L, which is an absurd. oO 


Remark 10.72 It is immediate to adapt the previous result to the case of a function 
f : [A,B) — R which is integrable in each interval [A,b] with A < b < B. 
More precisely, and as the reader can easily verify, the following assertions are 
equivalent: 


(a) f i f(t) dt converges. 

(b) For every « > O, there exists 6 > 0 such that B-—6 < x12 <B> 
| £2 FO dt| Ke. 

(c) There exists L € R such that, for every sequence (x,),>1 in [A, B) and satisfying 
litt+ hoovy = B, onehas limty-s4ng ffl dt = L. 


We now establish an important consequence of the previous theorem, which is 
known in mathematical literature as the comparison test for improper integrals. 


Proposition 10.73 Let f, g : [A, +oo) > R be such that |f(x)| < g(x) for every 
x € [A,+00). If f and g are integrable in every interval [a,b] C |A, +00) and 
if a g(t) dt converges, then [ Nas f(t) dt also converges and 


A 
+00 
d dt. 
flo) at] < i) g(t) dt 


| +00 
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Proof Firstly, notice that g(x) > 0 for every x > A, so that /' (ria g(t) dt > 0. Then, 


given € > 0, the convergence of iia g(t) dt, together with Theorem 10.71, assures 
the existence of M > A such that 


XQ 
X1,X2 > uf g(t) dt <€«. 
XxX] 


Now, the triangle inequality for integrals, together with Proposition 10.12, 
guarantees that, also for x1, x2 > M, we have 


i : f(t) dt 


Then, once more from Theorem 10.71, we get the convergence of the integrals 


f° FO dt and f° |f@| dt. 
Finally, it follows from what we did above, together with the properties of limits 
of functions (cf. Sect. 9.2) that 


< i If (t)|dt < iz g(t) dt <e. 


x1 


f(t) at 


| +00 


lim / "f(t dt 
A 


X—> +00 


= ii t) dt 
tim | [70 


lim a l@|dt < lim / “g(t dt 
X>+00 JA X—>+00 JA 


g(t) dt. 


IA 


II 


oO 


Remark 10.74 As in the previous remark, the former proposition can be easily 
adapted to deal with the case J = [A,B). More precisely, if f,g : [A,B) > R 
are integrable in each interval [A, b] with A < b < B, and such that |f(x)| < g(x) 
for every x € [A, B), then the convergence of // : g(t) dt implies that of Fi f(t) dt; 


moreover, one also has 
B 
[roa 
A 


Example 10.75 We shall use Proposition 10.73 to establish the convergence of 
Dirichlet’s integral 


B 
<[ soa. 


432 10 Riemann’s Integral 


To this end, start by observing that the fundamental trigonometric limit (cf. 


Lemma 9.11) assures that the function tf b> oh extends continuously to 0. 


Therefore, it suffices to establish the convergence of Reece 


To what is left, let x > 1. Integrating by parts, we get 


* sint cos ft |f=* * cost 
— dt = ——— — —— dt 
1 t t t=1 1 ie 
cos x * cost 
= cos 1 — - — dt. 
Xx 1 2 


Since | cosx| < 1 for every x, the computations above guarantee that i, oo ae dt 


co COST 


converges if and only if /, hy dt converges. Hence, we have reduced our qk 
to the convergence of this last i improper integral. 


Finally, it is enough to apply the comparison test, noticing that | 


cos x 1 
| < < = and 


(according to Example 10.70) i = “ = dt does converge. 


Problem 10, page 436, brings another proof of the convergence of Dirichlet’s 
integral. On the other hand, it is possible to show (cf. [10, 20] or [27], for 


instance) that 
+00 .: 
sin t 4 
/ Se 
0 t 2 


In turn, this computation is of fundamental importance to the theory of Fourier 
series.'' Figure 10.10 sketches the graph of x +> “2, 

Back to the development of the theory, Proposition 10.73 assures that if f : 
[A, +o0o) — R is integrable in each interval [a,b] C [A,-++oo) and rhea If (t)| dt 


converges, then // rise f(t) dt also converges and 


+00 


f(t)dt] < If(t)| dt (10.69) 


| +00 


(recall that |f| is also integrable in each interval [a,b] C [A, +00), by Proposi- 
tion 10.14). In such a case, we say that /) eae | (t) dt is absolutely convergent. Not 
every convergent improper integral is absolutely convergent; actually, Problem 11, 
page 436, shows that 1 ie sn dt is not absolutely convergent. 

We finish this section by showing that the link between the theory of convergent 
series and convergent improper integrals is actually deeper. More precisely, the 
coming result establishes an equivalence between the convergence of the integral 
of a positive function and that of a certain series. For this reason, it is known as the 
integral test for absolute convergence of series. 


"After Jean-Baptiste-Joseph Fourier, French engineer and mathematician of the XIX century. For 
more on Fourier see the footnote on page 469. A glimpse on the theory of Fourier Series is the 
content of Problems 11 to 17, page 468. 
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Theorem 10.76 Let no € N and f : [no, +00) — R be a decreasing function, 
integrable in each interval [a,b] C [no, +00) and such that lim,-++4o0 f(x) = 0. 
Then 


+00 
/ f(t) dt converges y S(k) converges. 
no k>no 


Proof Let g,h: [no, +00) — R be the functions given, in the interval [k, k + 1) (for 
each k > ng), by g(x) = f(k + 1) and h(x) = f(A). 

The conditions on f guarantee that 0 < g(x) < f(x) < A(x) for every x > no. 
Hence, the comparison test for improper integrals assures that 


+00 +00 
i f(t) dt converges > / g(t) dt converges 
no no 


and 


+00 +00 
/ h(t) dt converges => f(t) dt converges. 
no 


no 


On the other hand, for an integer n > no, we have 


n n n n—1 
[ soa= 1 nd [ nyar= Pre. 


no k=no+1 no k=no 
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Finally, let’s apply equivalence (a) < (c) of Theorem 10.71, with x, = k for 
every k > no: 


(i) If 190 dt converges, then | g(t) dt converges and, hence 


+00 n n 
fo eoar= tim, i andr = tim, FH= Yo FW. 


no k=not+1 k>no+1 


Therefore, >°;5,,..f(k) converges, with }°,.,,f(k) = f(no) + emeil @- 
(ii) If >°,,,,f(&) converges, then 


foe) n—-1 
/ ° h(t) dt = lim ah h(t) dt = lim >> fk) => f®,. 


k=ng k>no 


so that AD ie h(t) dt converges. Hence, what we did before assures the conver- 


gence of Sr" Fo) dt. 
oO 


Example 10.77 The integral test allows us to easily establish the divergence of the 
harmonic series, as well as the convergence of the series et be for every real 
number af > 1. For the harmonic series, letting f : [1,+00) — R be given by 
f(x) = -, we have 


+00 1 xy 
i —dt= lim —dt= lim logx = +o. 
1 t x>+00 J, ft x—>+00 


Analogously, taking f : [1, +00) > R given by f(x) = > with r > 1, we have 


saa | ; 1 1 1 
—dt= lim —dt= lim —-l1)J= ; 
1 t” x>too J, f x>too l—r \x! r—1 
Let’s now see one more interesting example. 


Example 10.78 Examine the convergence of the series )~ ke? not: 


Solution Let f : [2,-+oo) — R be given by f(x) = 
decreasing and such that lim, +0 f(x) = 0. The integral test says that )~ ko? [oak 


or so that f is continuous, 


converges if and only if (tas sts dt also does. 
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To what is left, note that the substitution of variable s = logt (cf. Problem 2, 


page 435) gives 
+00 +00 1 +00 1 
f(t) dt -| a= [ — ds 
2 2  tlogt log2 5 
= lim (ogs—log2) = 
sS—>+00 
Hence, | as f (t) dt diverges, so that as a also diverges. oO 


Problems: Section 10.9 


1. 


Nn 


Examine the convergence of the following improper integrals: 
(a) f, sec t dt. (c) i wat dt. 
(b) f° e~* sin tdt. (d) [7S pirat. 


. * Prove the following version of the change of variables theorem for 


improper integrals: let f : [A, -+-coo) > R be continuous and g : [B, +00) > 
[A, +00) be differentiable, with lim,.+.0 g(x) = +00 and g’ integrable in 


each interval [a,b] C [B, +00). Then, se f(t) dt converges if and only if 


f es f(g(s))g'(s) ds converges. Moreover, if this is so, we also have 


+00 +00 


f()) dt = 5 f(g(s))g'(s) ds. 


g(B) 


. * Show that ibs t* dt converges if and only ifa@ > —1. Then, fora < 0, conclude 


that fi (” dt converges if and only if x” grows to +00 more slowly than x~! as 


x —> 0+. 


. (Berkeley) Prove that both integrals Is cos(x*)dx and i sin(x)dx converge. 
. (Berkeley—adapted) Show that: 


(a) Jy log(sinx)dx and fy log | cos.x|dx converge. 
(b) fo log(sinx)dx = fy log| cos x|dx = —z log 2. 


. Examine the convergence of the following series, where aw > 0 is a given real 


number: 


1 
(a) Ver Cog hy me . 
(b) ae Woe : 


(c) Vir WogEiseoe 
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. Show that io #8" dt converges and compute its value. 


i¢e 
The next problem extends the results of Problem 12, page 381, and of 


Theorem 10.63. 


. * Let C R be an interval with endpoints a and f, f,g : I > (0,+00) be 


integrable functions and p, g > 0 such that : + a = 1, 


(a) If f? fo? dt = f? g(f)% dt = 1, show that f? f(Og(f) dt < 1. 
(b) Deduce Hélder’s inequality for integrals: 


B B 1/p B 1/q 
/ Fie(d) dt < ( / ror) ( i cova 


. In the proof of Theorem 10.55, we saw that in order to guarantee the existence 


of lim, +00 (yi i — log n) it suffices to assure the convergence of the series 


> (; —log(k +1) + logk} 


k>1 


To this end, let f : [1, +00) — R be given by f(x) = + —log(x + 1) + logx for 
x => 1, and do the following items: 


(a) Show that f is decreasing, with lim,_,40 f(x) = 0. 
(b) Apply the integral test to establish the convergence of the series. 


The purpose of this problem is to give another proof of the convergence of the 
improper integral re st dt. To this end, do the following items: 


(a) Prove that it suffices to show the convergence of the series }~ kel sok 
(b) In turn, use Abel’s criterion (cf. Problem 18, page 243) to show that this 
amounts to establishing the boundedness of the sequence (s,,),>1, such that 

n+1 


n . 
Sp = Dope, sink forn EN. 
sin 4 sin = 


(c) Apply the result of Problem 10, page 359, to find s, = —*+*——~ and, 
sin z 
hence, |s,| << —+. 
sin > 


. Do the following items: 


(a) Show that at least one of the numbers | sin x], | sin( + 1)| or | sin(a + 2)| 
is always greater than or equal to 5. 


(b) Conclude that the series }°,.; isin dl diverges. 


(c) Show that A ls ont dt is not absolutely convergent. 
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12. Ifp) = 2 < po =3 < p3 = 5 <-:-- is a sequence of prime numbers, 
a theorem of Euler states that the series a + diverges.'* In order to show 
this, do the following items: 


(a) Show that (1 — x)! < e** for0 <x < }. 
(b) Forn € N, let py < po < --- < p; be the prime numbers less that or equal 
to n. Show that 


Eien) 


(c) Use the results of (a) ue (b) to prove that ae 1 i < eis 1 PE 
(d) Conclude that }~ kel = diverges. 


10.10 Two Important Applications 


We recall that our initial motivation for the study of the integral was geometric, 
namely, to define and compute the area of the region of the cartesian plane under the 
graph of a given nonnegative (integrable) function. We could also use the concept 
of integral to deal with the following geometric problems: 


(1) How to define and compute the /ength of the graph of f : [a,b] > R? 

(II) In case f is positive, how to define and compute the volume of the solid of 
revolution generated by the rotation of the region under the graph of f around 
the horizontal axis? 

(III) Yet in the case of a positive f, how could one define and compute the area of 
the surface of revolution generated by the rotation of the graph of f around the 
horizontal axis? 


In this section we analyse Problem (I), leaving the analysis of Problems (II) and 
(IID) to [4]. As a byproduct of our discussion, we will present a thorough definition 
and computation of the length of a circle. 

Then, let f : [a,b] — R be continuous. Fix (cf. Fig. 10.11) an interval [c, d] C 
(a, b), a partition P = {c = xy < x1 < +++ < x, = d} of [c, d] and let Ax; = xj—x)-1 
forl <j <k. 

For |P| sufficiently small, it is a reasonable guess to think of the line segment 
joining points (x;-1,f(4j-1)) and (4;,f(4))) as a good approximation for the portion 
of the graph of f situated between the vertical lines x = x;, and x = xj. If this is 
so, it is also reasonable to assume that the length ¢; of such a segment should serve 
as a good approximation for the /ength of this portion of the graph of f, whichever 
way we define it. 


"For a sligthly different proof, see Chap. 9 of [5]. 
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Fig. 10.11 Approximating the length of a graph 


This being said, the formula for the distance between two points of the cartesian 
plane gives 


= 


Ga)? + Ce) =F Ge) P 


2 
- fis (DLN a, 
Xj — Xj-1 
Now, if f is continuously differentiable in (a, b), then Lagrange’s MVT guarantees 


the existence of & € (4-1, xj) such that ee = f'(&); this way, 
i 


£; = 1 + f'(§&)7 Ax; =,/1 + f'(§)? Qj — Xj-1). 
Adding up the above expression for €; for 1 <j < k, we conclude that 
k 
So VL + P(&)P2Q5 — 4j-1) (10.70) 
j=l 
would be a reasonable approximation for what we would think of as the length of 


the graph of the restriction of f to the interval [c,d]. Moreover, we could equally 
hope that such an approximation would get better and better as |P| — 0. 


On the other hand, since (10.70) coincides with the Riemann sum 


X(V 14+ (f)?; P:), 


it follows from Riemann’s Theorem 10.7 that 


k d 
y yi+r Gray — [ V1 +f’ (dx 
j=l : 


as |P| > 0. 
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The discussion above allows us to conclude that i . 1 +f’ (x)*dx is the only 
reasonable way of defining the length of the graph of the restriction of f to the 
interval [c, d]. If we now notice that the union of the graphs of such restrictions (as 
[c, d] varies over all bounded and closed subintervals of (a, b)) coincides with the 
graph of f but the points (a,f(a)) and (b,f(b)), we are naturally led to state the 
following 


Definition 10.79 Let f : [a,b] — R be continuous in [a,b] and continuously 
differentiable in (a, b). If the improper integral 


b 
e -| 1+f'@2dx, (10.71) 


does exist, then we say that the graph of f is rectifiable and has length £. 


Yet with respect to the former definition, recall that 


b-e 


b 
/ V1I+f'(x)2dx = lim V1 +f'(x)?dx. 
a EF" Jate 


Therefore, if f is continuously differentiable in [a, b], then Problem 11, page 380, 
assures that (10.71) holds in the ordinary sense (i.e., not as an improper integral). 


Example 10.80 Let T be a circle of center O and radius R, and choose a cartesian 
system of coordinates such that O(0,0). If f : [—R,R] > R is given by f(x) = 
VR? — x”, then the graph of f is the semicircle of I" situated on the upper half- 
plane of the cartesian plane. Note that f is continuous in [—R, R] and continuously 
differentiable in (—R, R). Hence, according to (10.71), the length of equals twice 
the value of the improper integral 


R 
/ V14+f'(x)*dx, 
-R 


provided it exists. 
To see that this is indeed the case, first note that 


2 
ial x —x 7 R 
VIFFOR = [1+ (5) = zS. 


Now, performing the trigonometric substitution x = R cost and observing that —R+ 
€<x<R-e $ arccos(—1 + 5) < t < arccos(1 — rae we get 
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R-e R-e R 
V1t+f'@)dx = ——-dx 
ba —R+e VR? — x? 


arccos(1—) R 
= i a ee 
arccos(—1+%) V R?2 — R2 cos? t 


arccos(I—) 
=. / Rat 
a 


recos(—1+ %) 


= R (arecos (-1 + =) — arccos (1 _ =)) : 


Finally, since the arccos function arccos : [—1, 1] — [0, z] is continuous, we obtain 


R-e 


lim V1I+f'()2dx = lim R (arccos (-1 + =) — arccos (1 - =)) 
+e ae 


e>0 =R 


= R (arccos(—1) — arccos(1)) = Ra. 


Finally, since essentially the same argument holds for —f, we conclude (as 
expected) that the length of I equals 277R. 


In spite of the former example, the integral in (10.71) is generally difficult 
(or even impossible—cf. Problem 2) to compute exactly. Let’s see an illustrative 
example. 


Example 10.81 Given b > 0, let’s compute the length ¢ of the portion of the 
parabola y = x? situated between the points (0, 0) and (b, b’). 
With f(x) = x? in (10.71) we get 


b 
L= / V1 + 4x?dx. 
0 


In order to compute such an integral, we apply the trigonometric substitution x 
5 tant. Since 1 + tan?t = sec? t, 4 tant = sectand0O<x<bs0<t 
arctan(2b), it follows from the formula of integration by substitution that 


IA Il 


arctan(2b) arctan(2b) 
L= / V1 + tan? t- sec? tdt = i sec? tdt. 
0 


0 


Now, item (c) of Problem 15, page 412, gives 


1 
[s° tdt = zg isect: tant — log | sect — tan?¢|) + C. 
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If we set a = arctan(2b), then tana = 2b and, hence, seca = /1+ tara = 
1+ 4b2. Therefore, 


1 a 
= (5 (secr- tan r— log | sect —tant|) + c)| 
0 


1 
= bV1 + 4b? — 5 log(vl + 4b? — 2b). 


We now turn to a completely different topic, using Proposition 10.73 to introduce 
and discuss some of the amazing properties of the Gamma function. 
The Gamma function is the function I’ : (0, +00) — (0, +00) defined by 


+00 
TQ) = / er tae 
0 


Since 


+00 1 -+0o 
/ et! d= i er dtt+ / er! dt, 
0 0 1 


in order to see that I(x) is well defined it suffices to establish the convergence of 
both improper integrals at the right hand side above. 

To what is left to do, initially observe that, since x > 0, Problem 3, page 435, 
guarantees the convergence of fe f—! dt. Since 0 < et! < & | for0 < t< 1, 
Remark 10.74 (modified in the obvious way to be applied to the interval (0, 1]) 
assures the convergence of ie or a. 

On the other hand, for a fixed x > 0, we claim that er! < Ce~"/? fort > 1, 
where C is a positive constant that only depends on x. Indeed, letting n > x + 1 be 
a natural number, it follows from Theorem 10.54 that 


7’ 
lim —~ =0. 
t>+00 et/2 
Therefore, there exists C > O such that + < C for every t => 1. However, if this is 
so, then 


| ae as ee a (10.72) 


for every t => 1, as we wished to show. 

Finally, note that an obvious variation of Example 10.69 guarantees the conver- 
gence of the improper integral /, Hil e—'/? dt. Hence, inequality (10.72) allows us to 
apply Proposition 10.73 (with A = 1, f(t) = e'r—! and g(t) = Ce~"’”) to conclude 
that /; hls e't—! dt does converge. 

As a quick look through the classics [3] and [17] makes it clear, the Gamma 
function has many important applications in Mathematics, notably in Mathematical 
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Physics and Analytic Number Theory. Roughly, one can say that this is due to the 
fact that it extends the notion of factorial of natural numbers to the positive reals. 
In order to check the above claim, let’s start by noting that Example 10.69 gives 


+00 
Td) = / e'dt=1. 
0 
On the other hand, for x > 0 the integration by parts formula furnishes 


1 s 
P(x) = lim / ete dt+ lim | e'r'dt 


S>+00 Jy 
=1 41 
+ ~ | er ar) 
t=s X Js 
ms 1 fs 
+- : er ar) 
t=1 xX J] 


a i 1 rte 1 
=- / e't'dt+-— 1 et dt+ lim —e“‘s*, 
xX JO X Ji 


S>+00 X 


1 : 
= lim (26 rt 
soo \x 


(- a 
+ lim |-e'r 


s>+oo \x 


Now, according to Theorem 10.54, we have lims-++60 tess" = 0. Therefore, it 
follows from the above computations that 


1 1 1 +00 1 +00 1 
T(x) = - | e't dt + ~ | eras ~ | ef dt = -T(x+ 1). 
x JO xX JI X JO x 
Then, 
Ta@t+1)=x«TQ@) 
for every x > 0, and an easy induction starting from (1) = | gives [(n) = (n—1)! 
for everyn € N. 
We now show that I" is continuous and logl’ : (0,+0co) — R is convex in 


(0, +00). Actually, since 


r= eos 


in view of Proposition 9.68 it suffices to establish the second claim above. 
The convexity of log T amounts to showing that 


log (1 — s)x + sy) < 1 —s) log (x) + sloglQy), 


for all x,y > OandO<s < 1. Letting s = and 1—s = =, we have p,q > 0 such 


that : + ; = 1. Then, the fact that log is an increasing function that takes products 
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into sums shows that the validity of the above inequality is equivalent to that of 
ae me \/p 1/q 
Ty-+-)<T@”’ToO)”, (10.73) 
P 4 


for all x,y > 0 and all p,q > 0 such that > + 7 = 1. 
In order to establish (10.73), we first observe that 


+00 * +00 sa ge 
r (- + *) = i ett ta! at = i (eer eter) dt. 
P q 0 0 


c= yo! 
Therefore, letting f(t) = eer , g(t) = e/4t7 and applying Hélder’s 
inequality (cf. Problem 8, page 436), we get 


+00 +00 \/p +00 4 
r (- ‘ *) =| f@s(oar< ( / pay” ar) ( i e(o" at) 
Pp 4 0 0 7 


too 1/p +oo V/q 
= (/ er! ar) (/ ep! ar) 
0 0 


= T(x)” ro)4. 


Remark 10.82 One can actually show that T° is infinitely differentiable, and that 
the expression for its kK—th derivative can be computed by differentiating under the 
integral that defines I’. In particular, we must have 


+00 
I’Q@) = 7 et! logtdt. 
0 


The convergence of the improper integral at the right hand side above is the object of 
Problem 5, and the proof of the continuous differentiability of the Gamma function, 
together with the above formula for I’’(x), will be the object of Problems 18 and 19, 
page 471. 


Problems: Section 10.10 


1. For x9 4 0, compute the length of the portion of the catenary situated between 
the points of abscissas 0 and x9. 

2. Let € be an ellipse of major axis AA’ and minor axis BB’, with AA’ = 2a and 
BB’ = 2b. If we choose a cartesian system in which A’(a, 0) and B’(0, b), it is 
well known that € has equation x + xy = |. In this respect, do the following 
items: 
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(a) If f : [-a,a] > R is given by f(x) = Ne — x”, show that the graph of f 
is the portion of € situated in the upper halfplane. Then, conclude that the 
length of € is given by twice the value of the improper integral 


a b2x2 
1+ ————~-d. 
[ a; a (a = a) - 


(b) Let c = Va? — b? and x = acost, with 0 < t < z. Use the formula of 
integration by substitution to show that the length of € is given by 


u 2 
ab] y/1+(<) si cat. 10.74 
i (5) . an 


The definite integral of item (b) cannot be computed exactly. More generally, it 
is possible to show (albeit this is well beyond the scope of these notes) that the 
indefinite integral 


/ V1+xsin’ tdt, (10.75) 


where « is a positive real, cannot be explicitly computed in terms of elementary 
functions.'> Thanks to (10.74), one says that (10.75) is an elliptic function or a 
highly transcendental function. Such functions were brought into prominence 
by the German mathematician of the XIX century Carl Gustav Jacob Jacobi, and 
for this reason (10.75) is also known as Jacobi’s function. 

3. We are given in the plane a straight line r and a circle I of radius 1, such that r 
and I are tangent to each other. The cycloid generated by I (cf. Fig. 10.12) is 
the curve described by a point P on T as I rolls along r without sliding. In this 
respect, do the following items: 


(a) Fix a position of I’ and let O denote its point of tangency with r at this 
position. Then, choose a cartesian system xOy having r as horizontal axis 


Fig. 10.12 The cycloid generated by I 


BA proof of this fact can be found in [12]. 
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and such that T° is in the upper half-plane. Show that, after I" rolls in the 
positive direction (with respect to the horizontal axis) by f radians starting 
from O, this point will be at position P(x(f), y(t), with x(t) = t— sint and 
y(t) = 1 —cost. 

Note that function t + x(f), from [0,22] into itself, is a differentiable 
bijection with inverse x +> t(x) differentiable in (0,277). Therefore, if f : 
[0, 277] — R is the composite f(x) = y(t(x)), the graph of f is the portion 
of the cycloid situated in the vertical strip of the cartesian plane defined by 
the inequalities 0 < x < 27. Such a portion is called a step of the cycloid. 
Compute the length of it. 


(b 


wm 


4. It is possible to show that tes e° ds = at (see, for instance, Chap. 8 of 


[20]). Use this fact to compute (3) = Jz. 
5. Show that, for x > 0, the integral ie er! log tdt is absolutely convergent. 


6. Letk > 1 andnj,n2...,n, be naturals such that k | (ny +2 +---+ nx). Prove 
that 


ny +no+-:+ +k 
|-——_-— 1> Yn !ny!... ng. 


Then, if k is odd, conclude that 


(SE ):> vine, 


Chapter 11 
Series of Functions 


This last chapter presents the adequate notion of convergence for sequences and 
series of functions, with an emphasis on the study of power series. We start by 
examining a power series naturally attached to an infinitely differentiable function, 
called its Taylor series. Then, in Sects. 11.2 and 11.3 we discuss the basic results on 
uniformly convergent series of functions and on power series. Finally, in Sect. 11.4, 
we present some applications of the theory to sequences defined by linear recurrence 
relations. 


11.1 Taylor Series 


This section begins by introducing another important generalization of Lagrange’s 
MVT, namely, Taylor formula! with Lagrange remainder. In order to properly 
state it, recall (cf. Sect. 9.3) that a function f : [a,b] — R is n times continuously 
differentiable in {a, b] if f is n times differentiable in [a,b] and f™ : [a,b] > R is 
continuous. 


Theorem 11.1 (Taylor) Let I be an interval and f : I > R be a function n times 
differentiable in I. Given distinct xo, x € I, there exists c between xo and x such that 


nal ek) (n) 
fo) = CO) yay +f Oe xo)". (11.1) 


‘After Brook Taylor, English mathematician of the XVIII century. 
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Proof Firstly, assume that x9 < x and let g : [x9,x] > R be given by 


e) =f) - ~ O 


a 
— 1k _ — (x _ ty", 
nN: 


with a € R chosen in such a way that g(x9) = 0 (evidently, such a choice is always 
possible). 

Since f is n times differentiable in J and [xo, x] C J, several applications of the 
results of Problem 3, page 300, guarantee that g is differentiable in [x, x]. Since 
g(xo) = g(x) = 0, RGlle’s theorem assures the existence of c € (xo,x) such that 
g’(c) = 0. On the other hand, a simple computation furnishes 


a-fO@) 


—D c=)", 


g()= 


so that for c € (xo, x) we have g’(c) = 0 if and only if a = f (c). Hence, 


0 = g(%) =f@) - 2 (x= 0)", 


n—1 a 
Pt yt $0 
n 


as we wished to show. 

Now, let x < x9 andJ = x9 +x—J = {xo +x—-1; t € J}. Then J is an interval and 
I = x9+x—J, so that we can define h : J > Rby A(s) = f(%o+x—s). Corollary 9.42 
assures that h is n times differentiable in J, with h®(s) = (—1)‘f (xp + x — s) for 
0 < k <n. Hence, by applying (11.1) to h and to so < sin J, we obtain 


nal pw (n) 
is) = (5 — so) + (6 — sy 


k! 
k=0 


for some c € (so, 5). In turn, this gives 


Te (DY Co + x= $0) 64 EDT O , 
FO + x= 8) = YA (5 — 5) + (5 — 50) 
k=0 
for some c € (so, 5). Finally, letting so = x and s = Xo (so that so < s), we get (11.1) 
for x < Xo. oO 


If f : 7 > Ris atwice differentiable function, then, for distinct x9, x € J, Taylor’s 
formula with Lagrange remainder guarantees that 


ag 


F(x) = f (Xo) +f’ 0) (x — x0) + <= (& — x0)” (11.2) 


for some c between Xo and x. 
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Fig. 11.1 The trapezium rule 


In what follows, given a positive and twice differentiable function f : [a,b] > R, 
we use the above relation to estimate the difference between the area of the region 
R under the graph of f and that of the polygonal approximation of 7? by the union 
of the gray trapezoids of Fig. 11.1. For this reason, the coming result is known in 
mathematical literature as the trapezium rule. 


Example 11.2 Let f : [a,b] — R be a twice differentiable (but not necessarily 
positive) function. If {a = x9 < x1 < +++ < X, = b} is a uniform partition of 
interval [a, b], then 


(b—a)? 
<a sup lf". 


2 “1 b—a 
F(x) de— YFG) +f6-)(—*) 
# k=1 


Proof For 1 <k <n, let gx : [x~-1,x%] > R be given by 
ss 1 
8x(x) = f(dt — 77) + f (X41) — Xk-1). 
Xk-1 


Letting A denote the expression at the right hand side of the stated inequality, the 
additivity of the integral, together with the fact that x, — x,-) = ma forl <k <n, 
gives 


A = 


S ( [fever 500) + Fou -n.0)| 
k=1 \°%k-1 
(11.3) 


Yo ge) 
k=1 


where we used triangle inequality in the last passage above. 
Now, since f is twice differentiable, the same happens with g;. Moreover, 
&x(Xp-1) = O and easy computations furnish 


= > Igi(xe)I, 
k=l 


1 1 
8.(x) = f) — af ON ~ x1) — 5 FO) + FGr-1)) 
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and 
1 
8x) = — sf" Oe — Xk-1); 


so that 94 (x1) = gi (Xe-1) = 0. Hence, Taylor’s formula with Lagrange remainder 
guarantees the existence of cy, € (xg—1, x) such that 


84 (Ck) 


5) (x4 — xe-1)? 


Bk(Xe) = BK(Xe—1) + Be (XK—1) (Xk — XK—-1) + 
= Ff (eles = x41) (Xe — xe)”. 


However, since x, — X,-1 = ra. it follows from the above computations that 


II 


1 
lex(xx)| qc) lle = x41 (Xk — Xe-1)” 


Ua 


IA 


1 
qh ex) | |X~ — Xe li = 


Finally, by substituting this last expression in estimate (11.3) for A, we get 


“ 


n n b— 3 3 hn 
d < Sle s yeni = 2-9 rien 
k=1 k=1 


b—a)> b— 
< ( a) up lf"| = = = = up if’. 


= . . S 
4n3 [a,b] 


oO 


In words, the result of the previous example gives a numerical approximation, 
with controlled error, for the computation of an integral. As the discussion of the 
computation of the length of an ellipse shows (cf. Problem 2, page 443), sometimes 
this is the best one can get. In this respect, see also Problem 3 of this section. 

For more on the use of Calculus methods to get useful numerical approximations, 
see Chap. 15 of [2] or Part B of [7]. 

Back to the development of the theory, let f : J > R be infinitely differentiable. 
For xo € J, we say that 


ye 20)! 


! 
0 k! 


is the Taylor series of f at xo. 
The coming result gives a necessary and sufficient condition for the Taylor series 
of f at xp to converge to f in J. 
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Proposition 11.3 Let I be an interval and f : I > R be an infinitely differentiable 
function. If there exists a constant C = 0 such that |f™(x)| < C" for everyn > 1 
and every x € I, then, given xo € I, we have 


6s) = PE (e— a 


k>0 


for every x €1. 


Proof Fixed x € I andn € N, it follows from Taylor formula with Lagrange 
remainder that 


n—1 


(n) 
fy = Te = x0) +O 6 my (11.4) 


for some c between xp and x. Now, 


PO (C|x — xo|)” =n 


xo)"| S 0, 
n! 


where we used the result of Problema 3, page 218, in the last passage above. 
Therefore, it suffices to let n — +c00 in (11.4) to get the desired result. oO 
Example 11.4 Since | sin? x| = |sinx| < 1 and | sin@~) x| = |cosx| < 1, the 
previous proposition gives 


for every x € R. In Sect. 11.5, these series expansions will be the departure points 
for the rigorous construction of the sine and cosine functions. 


We now apply Taylor formula with Lagrange remainder to show that the Taylor 
series of the exponential function converges to such a function along the whole real 
line. 


Theorem 11.5 For x € R, one has 


v= a (11.5) 
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Proof Fora fixed x € R, it follows from Theorem 11.1 (with x9 = 0) that 
x __ x C x 
c=) te i (11.6) 


for some c between 0 and x. Now, using the fact that the exponential function is 
increasing, together with the result of Problem 3, page 218, we obtain 


cc 


eon 


[x|" 
Cc 


x x” 
=e Aan ae ell = 
n! n! 


— 0. 


n! 


Hence, the squeezing principle gives lim,—++00 e°: a = 0. Finally, letting n — +00 
in (11.6), we get the desired result. oO 


Note that the previous result largely generalizes Example 7.41. For future 
reference, we observe that changing x by ax in (11.5), we obtain 


ak 
ee a (11.7) 


k>0 


for every x € R. 

Sometimes, estimates of the error LOK — Xo)” in (11.1) as simple as those 
we did above do not suffice to guarantee that the Taylor series of an infinitely 
differentiable function converges to such a function at all points it could do. We 
now look at such an example. 


Example 11.6 Function x +> log(1 + x) is defined in the whole interval (—1, 1) 
and is infinitely differentiable there. Letting f(x) = log(1 + x) for |x| < 1, it’s 
immediate to verify that f(x) = cy for every k € N. Hence, f(0) = 
(—1)*1(k — 1)!, and it immediately follows that the Taylor series of log(1 + x) 
centered at 0 is given by 


—| k-1 
> ‘se lige (11.8) 
k 

kl 

Since ore < |x|* and x|* converges when |x| < 1, the comparison 
k k>1 8 Pp 

test for series assures that (11.8) converges whenever x € (—1, 1). Nevertheless, 
if we try to estimate the error in (11.1) as we did before, we will only be able to 
conclude that 


ye 
log +) = So! ? al 


k>1 
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forx € [-5, 1). Indeed, it follows from (11.1) that, for 0 < |x| < 1, we have 


SCD GD! 


log +x) = > xk + ——__.. x" 
i k n(1 +c) 


for some c between 0 and x. 
If0 < x < 1, thenO <c <x < | and we get 


oa 


x") <x" “5 0, 
n(1 +c)" 


If —1 <x <0, then —-1 <x<c<0,sothatl1+c>1+.x > 0. Therefore, 


_1y)yr-l n n n 
lee oa 
n(1 +c)?" n (l+c)?7~n (+x)? n\1l+x 


[x] 


ia 1s —} < x < 0; yet in this 


For —1 < x < 0, one easily concludes that 
interval, the last estimate above gives 


(-1)""! ss lon 
——— -x"|< -— 0. 
n(1 +c)" n 
On the other hand, for —1 < x < 3 we have te > 1, so that 


1 ( |x| ) n 
- — +0; 
n\l+x 


—1jn-l 
hence, in the interval ( —1, —5) we are not able to conclude that | C2 


n(1+c)" 


A 


-x"| > 0 


when n — +00. 


We shall remedy situations like that of the previous example in Sect. 11.3. For the 
time being, let us show that it may well happen that the Taylor series of an infinitely 
differentiable function f : R — R, centered at some x9 € R, does not converge to f 
in any open interval centered at xo. 


Example 11.7 Let f : R > R be given by 


e/* ifx>0 
FO= 16 ix <0 


Then, f is infinitely differentiable in R, with f“(0) = 0 for every k € Zy. In 
particular, the Taylor series of f centered at 0 vanishes identically. 
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Proof Evidently, lim, 9+ e—!/* = 0), so that f is continuous at 0, thus, in the whole 


real line. Also evidently, f is infinitely differentiable in (0,-+oo) and in (—oo, 0), 
with f(x) = Oif x < Oandne Zy. 
We claim that, for n € Z+, there exists a polynomial p,, such that 


f(®) = pal /ne (11.9) 


for every x > 0. Let’s check this by induction on n > 0, the case n = 0 being 
obvious. By induction hypothesis, assume that for some integer k > 0 there exists 
a polynomial p;, such that f(x) = p,(1/x)e7!/* for every x > 0. Then, the chain 
tule gives, for x > 0, 


, 1 1 1 i 
if" Gy = 2 (n: (<) —P; (-)) eT = pi (/ne*, 


with py+i(x) = x°(pe(x) — p(x), so that py+1 is again a polynomial. 

Now that we have (11.9), let’s make one more induction to prove that f is 
infinitely differentiable at 0, with f(0) = 0 for every n € Z 4. By induction 
hypothesis, assume that f is k times differentiable at 0, with f“ (0) = 0. Since f is 
infinitely differentiable in R \ {0}, in order to show that f is k+ 1 times differentiable 
at 0, with f“*+ (0) = 0, it suffices to show that f“+))(0) = 0. To this end, note that 
for h > 0 we have 


LOK) ~FOO _ pe /ne"" 
i a 


= l/h 1h 
i ge(1/hye 


where q;(x) = xpx(x), a polynomial; then, again by Theorem 10.54, we have 


— fO)-f©O)  . gk(/h) uly) 
lim ———~—— = lim —+— = lim =~ =0 
h>0+ h h>o+  el/h y>too @ 


On the other hand, for h < 0 we have lim;-+9— Pero = 0, since f® (h) = 
f(0) = 0. Therefore, f+") (0) exists and equals 0. Oo 


Problems: Section 11.1 


1. Let J be an open interval and f : J > R be convex and twice differentiable in /. 
If x) € J and r denotes the tangent to the graph of f at the point (xo, f(xo)), show 
that no point on the graph of f lies under r. 

2. Let J be an open interval, f : J — R be n times continuously differentiable and 
x € Ibe such that f’(xo) = f" (xo) = +++ = f(x) = 0. If n is even and 
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f (xo) > 0 (resp. f (xo) < 0), show that xo is a point of strict local minimum 
(resp. maximum) for f, i.e., 


x €1\ {xo} sufficiently close to x9 > f(x) > f(xo) (resp., f(x) <f(xo)). 


3. Let £(E) be the length of the ellipse € of major axis 2a, minor axis 2b and focal 
distance 2c. Show that, for every n € N, one has 


n—-1 
2b j 2 1)bx3 
ts ee 1+ ae sin (=) < ee 
n j=l n n 


Cc 


where k = 5. 
4. Obtain the Taylor series of the functions sinh and cosh. In each case, show that it 
converges to the corresponding function in the whole real line. 
5. Let J be an interval and f : J > R ben times differentiable and such that f“) is 
constant. Prove that f either vanishes identically or is a polynomial of degree at 
most n. More precisely, if x, xo € J, show that 


n (i) = 
(= any. 
j=0 


J! 


6. Use (11.6) to give another proof of Theorem 10.54. 

The next problem revisits the analysis of Newton’s method for numerical 
approximations of roots of differentiable functions (cf. Problem 9, page 300). 

7. Let f : [a,b] — R be continuous in [a, b] and twice continuously differentiable 
in (a, b), with f’,f” > 0 in (a,b). Assume that f(a) < 0 < f(b), and let a be the 
only root of f in the interval [a, b]. Do the following items: 

(a) If B € (a, b] and y = p — FE, show that a <y<f. 
(b) If (dn)no1 is such that ay € (01,5) and dp41 = an — Fe 
show that (a,),>1 converges and a, > a@ as n > +00. 

(c) Refine the analysis of item (b) in the following way: 


for every n > 1, 


(i) Show that there exists &, € (a, a,) such that f(a,) = f’ (an) (an — @) — 
sf" (En) (An =a). 

(ii) Conclude that 0 < a,4; —a@ = Fie) (dn — a)” < i (dy — a). 

(iii) Suppose that a < c < d < b satisfy d—c < 1 andf(c) < 0 < f(d). 
If A = maxpeq ra and we start with a; = d, use (ii) to successively 
conclude that 0 < ay,4;—a@ < A(a,—a)? and 0 < a,—a@ < A"(d—c)”" 
for everyn > 1. 


Yet with respect to the previous problem, note that Problem 10, page 219, 
guarantees that A"(d — c)?" — 0 as n — +00. Hence, the result of item (iii) 
above estimates the error with which a, approaches a. Note also that we can 
get results similar to those of the previous problem by assuming that f’ > 0 and 
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f” < Oin (a, b); it suffices to start with a, € [a, a). The same is true if we assume 
that f’ < Oandf” <0 (resp. f” > 0) in (a, b); it suffices to start with a; € (a, b] 
(resp. a; € [a,a)). 

8. The polynomial f(x) = x* — 2x — 5 is such that f(2) = —1 and f@) — 3. 
Therefore, the [VT guarantees the existence of a root a € (2, 3) for f. Apply the 
results of the previous problem to estimate a with five correct decimal places. 
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The material on this section is a prelude to the study of power series and extends, to 
sequences and series of functions, some concepts and results of Sects. 7.2 and 7.4. 
We start by defining the concept of limit of a sequence of functions. 

In all that follows, unless stated otherwise, J denotes an interval. 


Definition 11.8 For each n ¢€ N, let a function f, : 1 — R be given. If 
limy++00fn(x) exists for each x € J, we define the pointwise limit of the sequence 
of functions (f,,),>1 as the function f : J > R such that 


f(x) = Ey fr(X) (11.10) 


n 


for every x € J. In this case, we also say that (f,),>1 1s a pointwise convergent 
sequence of functions, or that (f,,),>1 converges pointwise to /. 


The pointwise limit of a sequence f, : / — R of functions, if exists, is unique. 
Indeed, if (f,)n>1 converges pointwise to f : J > R, it follows from (11.10) that 
for each x € I the sequence of real numbers (f,,(x)),>1 converges to f(x); hence, 
the uniqueness of the limit of a convergent sequence of reals (cf. Proposition 7.14) 
assures that there exists only one possible value for f(x). 


Example 11.9 We collect here two examples illustrating the fact that the pointwise 
limit of a pointwise convergent sequence of functions is not necessarily a well 
behaved function. 


(a) Foreachn €N, let f, : [0, 1] > R be such that f,(x) = x” for every x € [0, 1]. It 
follows from Example 7.12 that the pointwise limit of (f,),>1 exists and is the 
function f : [0, 1] > R such that 


_ §0, if x € (0,1) 
TOS) ey 


In particular, this example shows that the (pointwise) limit of a pointwise 
convergent sequence of continuous (actually, even infinitely differentiable) 
functions can be discontinuous. 
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(b) For eachn € N, let f, : [0,1] > R be given by f,(x) = nx(1 — x*)” for every 
x € [0, 1]. Since 0 < x < 1 > 0 < 1-2 < 1, Problem 4, page 218, assures 
that lim, 49. nx(1 —x*)" = Oif 0 < x < 1. Also, since f,(0) = f,(1) = 0 
for every n € N, we conclude that the pointwise limit of the sequence (f;,),>1 
exists and equals the function f : [0, 1] — R that vanishes identically in [0, 1]; 
in particular, f is integrable, with i. f(x)dx = 0. Now, note that the FTC gives 


x=1 n 


et De 1 


ie (x)dx = n= yt 


Hence, lim,—++00 fy fa(x)dx = limy++o0 Ce = 5, so that 


1 1 
lim : In(x)dx F / tim Tn(x)dx. 


n—>+oo 


We now introduce a notion of convergence for sequences of functions which is 
stronger than that of pointwise convergence. As we shall see right after the coming 
definition, under such more restrictive notion of convergence pathologies like those 
of the previous will not take place. 


Definition 11.10 A sequence (f,,),>1 of functions f, : J > R converges uniformly 
for a function f : J > R if the following condition is satisfied: given « > 0, there 
exists m9 € N such that 


n> no => lfid) —f@|<e, Vxel. (11.11) 


In words, a sequence (f;,),>1 of real functions defined in J converges uniformly 
tof : 1 > R if the choice of a sufficiently large index n (n = no, in the notations of 
the former definition) makes |f,(x) — f(x)| < € for every x € J. 


Example 11.11 Forn € N, let f, : R > R be given by f,(%) = (—)”. Then, 


1432 
(fn)n>1 Converges uniformly to the function that vanishes identically in R. 


Proof Since ae < ; for every x € R, we have |f,,(x) — 0| = |fa)| < x ae 


x € R. In view of this inequality and given « > 0, choose ng € N such that 5; < € 
for n > no. Then, for n > no we have |f,,(x) —0| < € for every x € R, as wished. O 


The coming corollary is an immediate consequence of Definition 11.10. 


Corollary 11.12 [fa sequence (f,)n>1 of functions f, : 1 > R converges uniformly 
tof : I > R, then (fn)n>1 converges pointwise to f. 


The concept of uniform convergence has the following geometric interpretation: 
since 


lin(x) —f@)| < € & f@)—€ < fn) <f@) +€, 
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Fig. 11.2 Geometric 
interpretation of uniform 
converge 


we conclude that (f,)n>1 converges uniformly to f : J — R if, given arbitrarily 
€ > 0, there exists no € N such that, for n > no, the graph of f, is contained in 
the strip of the plane of width 2€ centered in the graph of f (the gray region of 
Fig. 11.2). 

The actual importance of the concept of uniform convergence lies in the coming 
Convergence Theorems 11.13 and 11.14. In particular, comparing such results with 
the sequences of functions collected in Example 11.9, we conclude that the converse 
of Corollary 11.12 is not necessarily true. In other words, we conclude that there 
exist sequences of functions f,, : / > R such that (f,),>1 converges pointwise, but 
not uniformly, to some function f : / > R. 


Theorem 11.13 [fa sequence (f,)n>1 of continuous functions f, : 1 > R converges 
uniformly to f : I > R, then f is continuous. 


Proof Given xo € J andn € N, the triangle inequality gives 


If) —fo)l S FG) —frOO] + fn) — fn%o)| + Lino) — fo). 


Now, for a given € > 0, the uniform convergence of (f;,)n>1 to f guarantees the 
existence of no € N such that n > no = [fn(x) —f(x)| < § for every x € I. Hence, 
by writing the previous inequality for n = no we get 


€ € 
lf) —f@o)| < 2- 4 + [fo ®) — fino 0) = 2 + fro (0) = fray (X0)|- 
On the other hand, the continuity of f,, assures the existence of 6 > 0 such that 


xET, |x—xo| <5 => fig) — Sing X0)| < =: 


Therefore, for x € J such that |x — xo| < 5, we have 
€ 


= €, 
2 


€ € 
Lf (x) —f(%o)| = 2 + Ing (x) — fry (Xo) > 2 or 
so that f is continuous at xo. 
Finally, since x9 was chosen arbitrarily in 7, we conclude that f is continuous in J. 
oO 


11.2 Series of Functions 459 


Theorem 11.14 Let (f,,)n>1 be a sequence of continuous functions f, : [a,b] > R, 
converging uniformly to f : [a,b] > R. If gn, g : [a,b] > R are defined by 


ens) = f fulddt and gx) = f ftode 
then (8n)n>1 converges uniformly to g. In particular, 
b b 
i, f@dx = lim / Jn(x)dx. (11.12) 
a n>+00 Ja 


Proof For x € [a,b], it follows from the triangle inequality for integrals (cf. 
Proposition 10.14) that 


ent) — 200) = | [Galo —F0O)e 


< [i -Fola 
Now, given € > 0, the uniform convergence of (f,,)n>1 tof gives mg € N such that 


n> = lft) —f| < —— 
b-—a 


for every t € [a, b]. Hence, for n > no and x € [a, b], the above inequalities allow us 
to estimate 


len(@) — 20) <| pots oa s b-ae, 


For the second part, recall that uniform convergence implies pointwise conver- 
gence. Therefore, g,(b) > g(b) as n + +00, which is the same as (11.12). Oo 


Problem 5 extends the above result to the realm of integrable functions. Actually, 
the following important remark holds true. 


Remark 11.15 In the notations of the previous result, a much more general result 
is available. In order to state it properly, let 7 C R be an interval with endpoints 
a and f (in the sense of Sect. 10.9) and f,, : I — R be given integrable functions 
(in the improper sense, if J or f is unbounded), with (f,),+1 converging pointwise 
to an integrable function f : J — R. If there exists an integrable function F : J > 
[0, +00) such that 


lfn(x)| < FQ), Vxel, neN, (11.13) 


then it is possible to show that 


: dx= li , d. 
[ floyd = tim / fuldd. 
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This is the content of Lebesgue’s dominated convergence theorem (we abbre- 
viate DCT), whose proof is beyond the scope of these notes. For the interested 
reader, we refer to [20] or [27]. 

Also with respect to Lebesgue’s DCT, it is worth observing that, if J = [a, b], it 
suffices to assume that there exists L > 0 such that 


lf(x)| <L, Vxel, neN. 


On the other hand, item (b) of Example | 1.9 shows that such a condition is necessary 
for Lebesgue’s DCT to hold true. 

Finally, Problem 4 shows that proving (11.12) in a particular case and without 
the aid of Lebesgue’s DCT can be a somewhat difficult task. Problem 18 asks you 
to prove a very important consequence of Lebesgue’s DCT. 


We continue our study of uniform convergence by presenting the famous 
Weierstrass approximation theorem, that states that every continuous function 
defined in a closed and bounded interval can be uniformly approximated by a 
sequence of polynomials. 


Theorem 11.16 (Weierstrass) Every continuous function f : |a,b| — R is the 
uniform limit of a sequence of polynomial functions p, : [a,b] > R. 


Before we jump into the proof, it is worth to do some heuristics to motivate it. 
For n € N, the binomial theorem gives 


FQ) =f@O(x+ A -9)" =f DO (i)? (— xy 
k=0 


Letting a9 =a <a, <a). <...<a,=b, witha =a+ <(b—a), the uniform 
continuity of f guarantees that, for large n, the values f(x) for x € [az—, ax] do not 
differ too much from f(a,). Since every x € [a,b] belongs to one of the intervals 
[ax—1, ax], we expect that 


fa) = Sriao( {0 —xyr 
k=0 


for large n, and the right hand side is a polynomial function. This is precisely what 
we shall prove. 


Proof To make the forthcoming computations a little simpler, assume first that a = 
0 and b = 1, and let 


Pu(x) = yr(4) (‘#0 =a 
k=0 
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Since f is uniformly continuous in [0, 1], given « > 0 we can choose 6 > 0 
such that |f(x) — f(y)| < € whenever x,y € [0,1] satisfy |x — y| < 6. Letting 
M = max{(|f()|; t € [0, 1]}, we have for a fixed x € [0, 1] that 


| 3 (ro -7(-)) () va —y4 
> be-r(5) @u ge 


O<k<n 
|x—4|<6 
n 


+ © feo-r(2)| () fa 


0<k<n 
la 5 26 


< a ({)#a-9r 4 > a(t ay 


O<k<n O<k<n 


lx 4|<é6 jx £|>6 


IF) — Pn(x)| 


II 


IA 


In the last line above, the first summand does not exceed 


»-({)0 —x)* = E(x + el —x))" =e. 
k=0 


On the other hand, the difficulty in estimating the second summand lies in 
estimating how many integers 0 < k < n satisfy the inequality |x — A > 6. We 


; . : 2 ; 
overcome this by inserting the factor z (x - 9) > 1 under the & sign to get 


am > qu —xrk< = 3 (z- “) (jee — xy, 
oS is 
Letting 
" k\2[n sm 
s= 3 (4) (for 


we proved that 


If) — Pa@)| S€ + 5 (11.14) 


for every x € [0, 1]. 
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Substituting (x — ay: =x? — yy (£)’ we compute 
— 2 - n n—k “ k[n n—k 
S=x > (j}*0-9 -a $7} 40-9 
k=0 k=0 
“\ k(n) , n—k 
k=0 
= 25 n—-1 ey ok n—-1 (1 — xy" 
k-1 n\k-1 
k=1 k=1 
“k-1f{n-1 “1fn-1 
ee ee 2, _ yyn-k = _ yyn-k 
=x et (ane x) a(t x) 
-—1\Qk-1f[n-1 x 
=--~4(2 ea—ort 42 
+( n ae k-1 =) ba 


= x7 +4 (— “2 (elas =) ae - 


so that S < + on the interval [0, 1]. 

Therefore, back to (11.14) we get 

- M 
If) — pa@®)| < € + mse’ 

for every x € [0, 1]. Now, it suffices to see that this last expression is less than 2e, 
provided n > . 

For the general case, let f : [a,b] — R be a continuous function. Let g : [0, 1] > 
R be given by g(x) = f(a —x)a+ xb), so that g is also continuous. What we did so 
far guarantees the existence of a sequence (q,,),>1 of polynomial functions such that 
dn — g uniformly on 0, 1]. If pa(y) = gn(—), then p, is obviously a polynomial 
function and, for y € [a, b], 


0) — Pa)I = |p(—=) - an (X=*) |. 


a 


Thus, pn mu f uniformly on [a, 5]. Oo 
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Example 11.17 Let f : [a,b] > R be a continuously differentiable function. Prove 
that there exists a sequence (pn)n>1 Of real polynomials such that p, ail f and 
Di, —, f’ uniformly on [a, 5]. 


Proof By the Weierstrass approximation theorem, we can choose a sequence 
(Gn)n>1 Of polynomial functions such that q,, ay f’ uniformly on [a, b]. Letting 


peo / an(tdt + f(a) 


for x € [a, b], we get a polynomial function satisfying p,(a) = f(a) and p/, = qy for 
everyn EN. 
Hence, we can use the FTC again to write 


(O=n@= i; fi(pat— / an(t)dt = / (F'(0) — qu(t))at 


for every x € [a, b], so that 


x b 
FG) — pal] / LF’) — qul)|dt < i. IF (0) — ul) |dt. 


Since g, — f’ uniformly as n > +00, given « > 0 we can find mp € N such 
that [f’(1) — gn(t)| < € for every t € [a,b] and every n > no. Therefore, for n > no, 
we have 


b 
If) — Pa)| <|/ edt = €(b—a) 


for every x € [a, b]. Oo 
We now turn our attention to the study of series of functions. 


Definition 11.18 Given a sequence (f;,)n>1 of functions f, : 1 > IR, we define the 
series of functions Ie je as a shorthand for the sequence (s,)n>1 of functions 
Sn: 1 > R, such that s, = )°f_, fe for every n > 1. 


In the notations of the former definition, we say that }°,. , f; converges pointwise 
(resp. uniformly) in / tof : J + R if the sequence (s,,),>1 converges pointwise (resp. 
uniformly) to f. In this case, we write 


f= oh 


k>1 


and note that f(x) = 0,5 fe(x) for every x € I. 
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We shall generally apply Theorems 11.13 and 11.14 to uniformly convergent 
series of functions. In this sense, the coming corollary is of paramount importance 
for us. Note that, in words, its item (b) says that uniformly convergent series of 
continuous functions can be integrated termwise. 


Corollary 11.19 For each n > 1, let f, : [a,b] — R be a continuous function. If 
the series Deh converges uniformly to f : [a,b] > R, then: 


(a) f is continuous. 
(b) So Casi fde = Dest So fea. 


Proof (a) If s, = ans then s, is a finite sum of continuous functions, so that it 
is itself continuous. Since s, — f uniformly, Theorem 11.13 guarantees that f 
is a continuous function. 

(b) Since s, — f uniformly, Theorem 11.14, together with the additivity of the 
integral, gives 


bn 


b b 
; f(xdx = lim | Sy(x)dx = im | 2 ladder 


n b b 
= lim fi(x)dx = Sic(x)dx. 
slim def Hede= 3" fs 


k>1 


oO 


The former corollary will only be useful if we have an efficient way of finding 
out, in cases of interest, whether or not a given series of functions is uniformly 
convergent. The coming result, known in mathematical literature as Weierstrass M- 
test, provides a simple sufficient condition for the uniform convergence of a series 
of functions. Note that the “M” stands for majorization. 


Theorem 11.20 (Weierstrass M-Test) Let >°,.; fx be a series of functions defined 
in an interval I and satisfying the following conditions: 


(a) For each k = 1, there exists My > 0 such that |fx(x)| < M, for every x € 1. 
(b) The series Pps My converges. 


Then, the series of functions °,. | fe converges uniformly in I. In particular, if all of 
the f,.’s are continuous in I, then so is Y p54 fr 


Proof Given x € I, since [f(x)| < My for every n > 1 and )°,., My converges, 
the comparison test for series of real numbers (cf. Proposition 7.44) guarantees the 
absolute convergence of the series )°,. | f(x). Hence, we get a well defined function 
f : 1 — R such that 7 


f@) =f), Vxel. 


k>1 
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In order to establish the uniform convergence of )7,. , fy, let 5) = )-y—, fe. For 
x €lTandnéN, we have 


If) = Sn(x)| = 


ire 


k>n 


< fe) < 0 Mc. (11.15) 


k>n k>n 


Now, given € > 0, the convergence of the sequence ey Mk) +1 to ops Me 
assures the existence of np € N such that 7 


> <eée, Vn=>no. 
k=1 


k>1 


Hence, |}°,., Mi; | < € for every n > no. 
Back to (11.15) we conclude that 


IF) — n@)| < DOM <e 


k>n 


for n > no and every x € I. Therefore, (s,)n>1 converges uniformly to f, and this is 
the same as saying that )~ koi tk Converges uniformly to f. 
The last part follows at once from Theorem 11.13. oO 


Example 11.21 The Weierstrass M-test can be used to show that the series of 
functions >> k=l z sin(kx) converges uniformly in R. Indeed, on the one hand, since 
| sin(kx)| < 1 for every k > 1 andx € R, we get 


1 


for every k > 1 and x € R. On the other, just note that the series }°,. , a is 
convergent. 


Accidentally, the previous example also shows that, given a uniformly convergent 
series ),. fi of differentiable functions f, : 1 > R, the series }°,., ff of their 
derivatives is not necessarily convergent, even pointwise. Indeed, 


2 < (= sn()) = > : cos(kx), 


k>1 k>1 


which doesn’t converge pointwise in any real x of the form x = 2¢z, with £ € Z. 

We finish this section by using the material developed here to give an example of 
a continuous function which is not differentiable at any point. For what comes next, 
the reader might find it helpful to read the statement of Example 8.23 again. Up to 
details, we follow the discussion in Sect. 9.7 of [9]. 


466 11 Series of Functions 


Example 11.22 For y € R, let d(y) denote the distance from x to the nearest integer. 
If f : R > Ris given by 


fo) = (S) aao'v, 


k>1 


then f is well defined and continuous, albeit not differentiable at any x € R. 


Proof Example 8.23 guarantees the continuity of x +> d(10*x), for every k > 1. 


On the other hand, since 0 < (2.)'a(0kx) < u(ay and Vel (y converges, the 
Weierstrass M-test assures the well definiteness and continuity of f. 
Fix x > 0 (the case of x < 0 is analogous and will be left to the reader). We shall 


prove that f is not differentiable at x by constructing sequences (a,),>2 and (bn)n>2 


such that a, < x < b, and ay, by maa x, but 


F(bn) =f (Gn) | 1 


—_ 7 
by — An i 


This will contradict an obvious slight modification of Problem 8, page 300 (by 
allowing a, < x < by, instead of a, < x < by, there). 

Given x > 0 and an integer n > 2, let m, € N be such that m, < 10"x < 
m, + 1. If a, = 10-"m, and b, = 107"(m, + 1), we obviously have a, < x < by, 
and b, — a, = 10-". If k > nis also integer, then 10*a,,10*b, € N, so that 
|d(10‘b,,) _ d(10*a,)| = 0; on the other hand, if k < n, then Example 8.23 gives 


|d(10*,) — d(10*a,)| < 10. (11.16) 
Therefore, 
n-1 9 \k 
F(x) —F(an) = Y> (=) (4 A0%n) — a(104a,)). (11.17) 
k=0 
Since 10""!a, = a“ and 10h, = a by separately considering the cases 


0 <x < land x > 1 one easily sees that |d(10~',, - d(10~'a,)| = a: Therefore, 
the triangle inequality in (11.17), together with (11.16), gives 
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roto) — 1X Gg) t's acorn) 
k=0 


n—2 


=e (=) |ac1o%,) — d(10*a,)| 


fon) — f(an)| 


IV 


grl 
10" 


gr-l 9 \k Sy 1 
Te ag) = age 


= k=0 


IV 


IV 


ee ee ae 


er es 


Thus, 


f (bn) —f (an) Ree alla 


+00. 
bn — An > 8 


Problems: Section 11.2 


1. Prove that the series of functions Dei Ge) converges uniformly in R and 
compute its sum. 


2. Prove that the Taylor series of the functions e*, sinx and cosx converge 
uniformly to such functions in each interval [—a, a], with a > 0. 

3. Forn EN, let f, : [0,1] — R be given by f,(~) = nxe~™. Prove that (f,)n>1 
converges pointwise, albeit not uniformly, 0. Also, show (without invoking 
Lebesgue’s DCT) that Jo fal) dx > O0asn— +00. 

4. For n > 0 integer, let f, : [0,1] — R be given by f,(x) = x"e™™*. Do the 
following items: 


(a) Show that (f,)n+0 converges pointwise to f : [0,1] > R, with f@) = 
0, ifx<1 
e, ifx=1° 
(b) Show that, for each integer n > 0, there exist natural numbers a, and b, 
such that fy fr(x) dx = dn — bne7!, with ag = bo = 1, dy = nay— and 
b, = nb,_-; + 1 for everyn € N. 
(c) Conclude that a, = n! and b, = n! 7-9 a for everyn EN. 


(d) Use the result of the previous items to show that ie Jn(x) dx — 0 as 
n— +00. 
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. Letf, : [a, b] > R be a sequence of integrable functions, converging uniformly 


to a function f : [a,b] > R. Show that f is integrable and 


b b 
i. flax = tim / A@ar. 


. Let (fr)n>1 be a sequence of differentiable functions f, : [a,b] — R, such that 


fi. (x)| < M, for every n € N and x € [a,b]. If }°,., My is a convergent series 
and there exists x9 € [a,b] such that >... f:(xo) converges absolutely, show 
that the series of functions )7,. | fi converges uniformly in the interval [a, b] to 
a differentiable function, such that 


d / 
x df) = DK) 


k>1 k>1 


for every x € [a, D]. 


. (Putnam) Letf : [a,b] > R be a continuous function such that /’ : f(x)x*dx = 0 


for every k € Z. Prove that f vanishes identically. 


. (Berkeley) Does there exist a continuous function f : [0,1] — R such that 


Jo fa)xdx = | and i. f(x)x*dx = 0 for every nonnegative integer k 4 0? 
Justify your answer. 


. (Berkeley) Let gy, : [0,1] — R be a sequence of nonnegative continuous 


functions such that 
1 


lim x pn (x)dx 
n—>+o0 Jo 


exists, for each k € Z4. Prove that, for any given continuous function f : 


[0, 1] > R, the limit 


lim / F(X) Gn(x)dx 


n—>+oo 


also exists. 


. With respect to the series of functions )°,., i sin(kx), do the following items: 


(a) Use Abel’s criterion (cf. Problem 18, page 243), together with the discus- 
sion of items (b) and (c) of Problem 20, page 244, to show that the given 
series converges pointwise in the interval (0, 277). 

(b) Revisit the proof of Abel’s criterion, as sketched in the hints given to 
Problem 18, page 243, to show that Abel’s identity guarantees that the 
convergence of item (a) is uniform in every interval of the form [5, 27 — 6], 
forO<é< Zz. 
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11. Let f : R — R be a piecewise continuous function, periodic with period 27. 
The Fourier series’ of f is the series of functions 


ao(f) 
2 


+ S> (ac(f) cos(kx) + be (f) sin(kx)) 


k>1 


where 


Tu 


a(f) = = . f(x) cos(kx)dx and b,(f) = = 7 f(x) sin(kx)dx 


for all k. In this respect, do the following items: 


(a) Iff is continuously differentiable and k > 1, prove that 


af) =—Thi(f') and ba(f) = Taf). 


Then, conclude that az,(f),b.(f) — 0 as k — +00 (this is a special 
case of Riemann-Lebesgue’s lemma, to be proved in greater generality in 
Problem 15). 

(b) Iff is twice continuously differentiable and k > 1, prove that 


1 1 
ar(f) = eal) and bi (f) = — Bef”): 


Then, conclude that the Fourier series of f converges uniformly in R and, 
hence, defines a continuous function in R. 

(c) Yet assuming f to be twice continuously differentiable, let g : R — R be 
the continuous function given by 


20) = 2D £ T Cax(f) cos(kx) + by(f) sin(ha)). 


2 
k>1 


Prove that a;(g) = a;(f) for 1 > 0 and b,(g) = b)(f) for! > 1. 


For the next two problems, we say that a function f : J > R is piecewise 
continuously differentiable if there exists reals a; < az < ... < a, in J such 
that f is continuously differentiable in all of the intervals (—oo, a,| OJ, [a;, aj+1] 
for 1 < i < nand [a,, +00) NJ. We shall assume without proof the validity of 
Fourier’s convergence theorem,’ which states that if f : R — R is piecewise 


Fourier’s studies on heat conduction, collected in his famous book Théorie Analytique de la 
Chaleur, laid the groundwork for modern Mathematical Physics, which in turn has greatly 
influenced the development of the theory of Partial Differential Equations. 


3In spite of being known by the name of Fourier, such a result is actually due to Dirichlet. 
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continuous, piecewise continuously differentiable and periodic of period 27, 
then its Fourier series is pointwise convergent in the whole real line, converging 
at xo € Rto 


5(,lim_ FO) + im, F009), 


For a proof of Fourier’s convergence theorem, together with a discussion of 
several other interesting properties of Fourier series, see [10, 20] or [24]. 

12. Let f : R > R be periodic of period 27 and given by f(x) = x’ in the interval 
[—z, 7]. 


(a) Compute its Fourier series. 


2 
IU 


(b) Use Fourier’s theorem to show that )°,., z =. 


13. Let f : R — R be periodic of period 27 and such that 


0, if -~7<x<0 
fmM= 41, if0<x<az 


(a) Compute its Fourier series. 
(b) Use Fourier’s theorem to deduce Leibniz formula‘ for z:: 


1 1 1 


14. (Miklés-Schweitzer—adapted) Let p > 0 be a real number and f,g : R— R 
be continuous functions, with g being periodic of period p. 


(a) Prove that 


¥ foul (Beamer = ([" ora) 2 3¢(2) 


k=1 


= =( reas) ([ eoray), 


(b) Prove the following theorem of Fejér’: 


lim [ reoscnas = ~([revas)( f° gx)ds). 


n>+ 


4We shall give a self-contained proof of such a formula in Problem 9, page 484. 
> After Lipot Fejér, Hungarian mathematician of the XX century. 
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15. Prove the Riemann-Lebesgue lemma: let f : R — R be a continuous function, 
periodic with period 27. If a,(f) and b;(f) denote the Fourier coefficients of f, 
as defined in Problem 11, prove that a; (f), by(f) > 0 as k > +00. 

For the next problem, let f : R — R be continuous and periodic with period 
2n.Forn € Z4, let 


Sif (x) = > + > (accos(kx) + by sin(kx)) 
k=1 


denote the n-th partial sum of the Fourier series of f and 


Sof (x) + Sif) + +++ + Sif) 


Onf (x) = Pen 


Another theorem of Fejér states that o,f =a f uniformly on R. 

16. Let f,g : IR — R be continuous and periodic with period 27. If a,(f) = a,(g) 
and b;(f) = b;(g) for every k, prove that f = g. 

17. (Berkeley) Find all continuous functions f : R — R such that f(x) = f(«+1) = 
f(x + V2) for every x € R. 

18. * This problem establishes a fairly general version of Leibniz’ rule of 
differentiation under the integral sign, assuming the validity of Lebesgue’s 
DCT. To this end, let 7, / C R be intervals and f : J x J > R be continuous 
in each variable separately.® Let a and B be the endpoints of J (in the sense of 
Sect. 10.9). 


(a) Assume that for every x9 € J and every sequence (x,,),>; in / converging to 
Xo, there exists an integrable function g : J > R such that |f(@,,)| < g() 
for every n > 1 and every t € J. Then, t + f(x, f) is integrable for each 
x € I. Moreover, if F : J > R is given by 


B 
F(x) = / flr tdt, 


then F is continuous. 

Let the function x +> f(x, f) be continuously differentiable for each t € J, 
and let a denote its derivative. Assume that for every x9 € J and every 
sequence (x,),>1 in J converging to xo, there exist integrable functions g : 


J — Rand G: J > R such that |f(%,0| < g(t) and E(%n.0| < G(t) 
for every n > 1 and every t € J. Then, t b f(x,t) andt b # G, t) 


are integrable for each x € J and F : I — R, defined as in item (a), is 
continuously differentiable in J, with 


(b 


wm 


B 
Fe = | F.1d (11.18) 


®T.e., such that both t > f(x, f) and x > f(x, t) are continuous functions. 
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19. * Show that the Gamma function is continuously differentiable, with 


+00 
(a= / er logtdt. 
0 


11.3. Power Series 


In Sect. 11.1 we saw several examples of infinitely differentiable functions defined 
in open intervals and which coincide with their Taylor series in such intervals. The 
Taylor series of an infinitely differentiable function is a particular case of a power 
series, i.e., of a series of functions of the form 


So ax(x — x0), (11.19) 
k>0 
where ao, a1, do, ... are given real numbers. In this case, as it happens with number 


series, we say that a,(x — x9)” is the n-th term of the power series and that a, is the 
n-th coefficient of it. 

In this section we develop the basic aspects of the theory of power series, starting 
with the following central result. 


Theorem 11.23 Given a power series ae a(x — xo)‘, there exists 0 < R < +00 
such that the series: 


(a) Converges absolutely in the interval (x9 — R,X9 + R) and diverges in 
R \ [xo — R,x0 + R]. 
(b) Converges uniformly in the interval |x9 —r,x) +r], VO<r<R. 


Proof Firstly, note that the series )°,., ax(x — xo)* converges absolutely in the 
interval (xo — R, x9 + R) if and only if the series 3 k>0 apxt converges absolutely 
in the interval (—R,R). Accordingly, )°,.9 ax(x — xo)* converges uniformly in 
[xo — r,xo + r] if and only if >). ax" converges uniformly in [—r, r]. Therefore, 
we can assume that xo = 0. 

Let’s first deal with item (a). 


Claim 4 if °,.5 a,x* converges at x = a # 0, then it converges absolutely at any 
x € (-a,q@). 
Indeed, for such an x, we have 


ky k al 11.20 
Dla] = Do lavet| |=)". (11.20) 


k>0 k>0 


Since 3 9 a,o* converges, Proposition 7.36 guarantees the existence of no € N 


such that n > no => |a,c"| < 1; therefore, it follows from (11.20) that |a,x*| [a < 
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xk ; : : x|k: ¥ 
|| for k > no. Since the geometric series > en || is convergent (for, =| < 1), 
the comparison test assures that the same holds true for )* — |a,x*| and, thus, for 


Vico |ax*|. 


Claim 5 if )>,.) a,x* diverges at x = B # 0, then it also diverges at any x € R 
such that |x| > ||. 
For such an x, if )°,. a,x* converged, the previous claim would assure the 


(absolute) convergence of the series }°,. 5 ax B*, which is an absurd. 


Claims 4 and 5 guarantee that, with respect to )7,. a,x*, one of the three 
following possibilities does happen: (i) it converges only at x = 0; (ii) it is absolutely 
convergent at any x € R; (iii) there exist a, 8B # 0 such that |w| < |6| and the series 
converges absolutely when |x| < @ and diverges when |x| > ||. 

If either (i) or (ii) happens, there is nothing left to do. If (iii) happens, let 


R = sup{r > 0; ss au’ converges absolutely when |u| < r}. 
k>0 


We claim that ae a,x* converges absolutely when |x| < R and diverges when 
|x| > R. To this end, let’s look at two separate cases: 


(i) If |x| < R, take r such that |x| < r < Rand >>). au‘ converges absolutely 
when |u| < r. Then, in particular )>,. 9 ax is absolutely convergent. 

(ii) If |x| > Rand >>... a,x" converged, then, taking R satisfying R < R < |x, 
it would follow from Claim 4 that a, axu* converges absolutely when 
|u| < R. But this obviously contradicts the definition of R. Therefore, >> ,.5 ax 
diverges. 


For item (b), given 0 < r < R, it follows from what we did above that )*,.., |ax7*| 
converges. Hence, letting My = |axr*|, we have, for |x| < r, that |ayx“| < |agr*| = 
M,. Therefore, Weierstrass M-test assures that ae ayx* converges uniformly in 
the interval [—r, r]. 7 Oo 


In the notations of the previous result, we say that 0 < R < +00 is the radius 
of convergence of the power series (11.19), and (x) — R, xo + R) is its interval of 
convergence. 

Problems 10 and 11 give a general formula for the radius of convergence in 
terms of the coefficients of a given power series. For the time being, the following 
consequence of the previous result allows us to easily compute it in a number of 
interesting examples. At this point, we suggest that the reader runs through the proof 
of the ratio test (cf. Proposition 7.51) once more, just to note that it remains valid if 
(in the corresponding notations) / = +-oo. 


Corollary 11.24 Let (ay)n>0 be a sequence of nonzero real numbers. If there exists 


0 < R < +00 such that limn-++00 ae | = R, then the power series Y°,.) ax(x — 


xo)* has radius of convergence equal to R. 
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Proof Since 


Qnt+1 (x ~~ xo)"t! 


n(x — Xo)" 


_ |x — Xo| n |x — Xo| 
7 R 


’ 


an 


an+1 


lx=xol pal 


the ratio test assures that }°,.9 ax(x — xo) converges absolutely if < 1 and 


diverges if = al > 1. Since wl < 1 if and only if |x — xo| < R, ie previous 
result siaraiitees that R is precisely the radius of convergence of the given power 


series. oO 


Example 11.25 


(a) The power series }°,., 7x“ has radius of convergence 1, for | 7 - 7] 5| = mh, 
lasn — +00. 

(b) The series Des i k\x* has radius of convergence 0, for nl = aan > 0. 

(c) The series ee aes = gy has radius of convergence +00, for en | = 


n+1— +00 asn — +00. Actually, we already know from the material of 
Sect. 11.1 that SS a — 2) = e** for every real x. 

(d) The former corollary does not apply to the power series }~ kel ae for it has 
infinitely many vanishing coefficients. See, however, Problem 2. 

(e) Given a # 0, the previous corollary assures that the power series Yiealon)* 


a” 


has radius of convergence fu for | ar 


— tat for every n > 0. By 


Proposition 7.38, for |x| < — we have 


Deni = 


k>0 


(11.21) 


l-ax 


We now collect another important consequence of Theorem 11.23, which will be 
of crucial importance for the proof of the subsequent result. 


Proposition 11.26 [f the power series ))j59 a(x — xo)* has radius of convergence 
R > 0, then: 


(a) The function f : (x) — R,x9 + R) > R, given by f(x) = are a(x — xo)*, is 
continuous. 

(b) For every x € (xy — R, xo + R), we have Jc,F Odt = Leap xo)Kt 1. In 
particular, the radius of convergence of the series at the right hand side above 
is greater than or equal to R. 


Proof Let fy, : (xo —R, x0 +R) — R be given by fy(x) = ax(x—xo)*. Theorem 11.23 
givesf = 7.1 fx in o—R, x0 +R), the convergence being uniform in [xo—r, xo +7], 
for every 0 < r < R. Since f; is continuous for every k, items (a) and (b) follow 
immediately from items (a) and (b) of Corollary 11.19. oO 


The coming theorem is the central result in the theory of power series. 
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Theorem 11.27 If the power series Yeo a(x — xo)* has radius of convergence 
R > 0, then: 


(a) The function f : (xo — R,xXo + R) > R, given by f(x) = Voy aK — xo)*, is 
infinitely differentiable. 

(b) For everyn € N, we have f(x) = ss Toa - a(x — x9) for every x € 
(xo — R,X0 + R), and the series defining f” also has radius of convergence 
equal to R. 


Proof As in the proof of Theorem 11.23 we can assume, without any loss of gen- 
erality, that x9 = 0. We shall first prove that, letting R be the radius of convergence 
of the power series }>,. , kagx*—!, then R = R and f'(x) = 7,5, kaxx*“! for every 
x € (—R,R). ~ ~ 

Note first that |ka,x*| > |azx*| for every integer k > 1 and every x € R. 
Therefore, if the power series }°,., ka,x*— converges absolutely at some x, then the 
same holds true for the power series }*,. , kaxx* and, thus, for )°,.) a.x*. Hence, 
R<R. 

Now, for 0 < x < RandO < h < R—x, we have 


faxth)—-f(x) a1 Sante +h) — Saget _ ya(HP—). 


h h h 
k>1 k>1 k>1 


On the other hand, Lagrange’s MVT gives, for each k > 1, some cy € (x,x + h) 
such that eis = kck—!. Therefore, 


fat ” f(x) o Yo kaye, 
k>1 

so that the power series pao ka,ck! converges absolutely. Since |ka,x*!| < 
|kayck—! |, the comparison test guarantees that 7.) ka,x'—! also converges abso- 
lutely. Analogously, such a series converges absolutely if —R < x < 0, so that 
R>R. 

Our discussion up to this point assures that g : (—R, R) — R, given by g(x) = 
ae ka,x*—", is a well defined function. By item (b) of the previous proposition, 
we have 


[ soa = > ty a yas = f(x) — ao. 


9 ke 1 k>1 


Hence, the FTC gives, for |x| < R, 


f°) = ge) = Dl kat. 


k>0 
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For what is left to do assume, by induction hypothesis, that we have already 
shown f to be m times differentiable in (—R, R), with f™ (x) = Yoisn oa -ar(x— 
xo)*—™; assume also that this last power series has radius of convergence equal to 
R. Then, by the first part, f” is a differentiable function, the power series defining 

/ . 
(f (m)) =f" has radius of convergence R and 


fray = Do (k= m)ay(x = x9)" 
k>m+1 (km)! 
k) 
= > —$<—— a (¢ = xp) rt, 
= ! 
k>m+1 (k (m e 1)! 
This completes the inductive argument and, thus, the proof of (b). Oo 


In the notations of item (b) of the previous result, we observe that the power 
series expansion of f is obtained by termwise differentiating, n times, the power 
series expansion of f. Indeed, letting & denote the n-th derivative of a function (if 
it exists), an immediate computation gives 


n 


dx” 


(x —x9)* = 


We now collect two useful consequences of the previous theorem, the first of 
which refines the analysis of item (b) of Proposition 11.26. 


Corollary 11.28 If the power series ae, a(x — xo)" has radius of convergence 
R > 0, then the radius of convergence of the integrated power series Veo pit - 
xo)*t! is also equal to R. 


Proof Since > ps9 a(x — xo)* = a pry es coe xo)*t!, item (b) of the previous 
theorem guarantees that both of the given series have the same radius of conver- 
gence. oO 


Corollary 11.29 Assume that the power series ae. a(x — xo)" has radius of 
convergence R > 0, and let f : (x9 — R,xo + R) > R be given by f(x) = 
eo ay(x — x0)*. Then: 


(a) an = fro) for every n > 0. 
(b) Veo ay(x — xo)* is the Taylor series of f. 


Proof Item (b) follows immediately from (a). For (a), item (b) of the previous result 
i (n) LM, ty — fo) 
gives f (xo) = Gant an = Ndn; therefore, a, = >. oO 
We shall now see that the results presented so far allow us to reobtain, by means 
of a unified approach, some of the Taylor expansions discussed in Sect. 11.1. In this 
respect, see also Problem 3. 
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Example 11.30 Item (e) of Example 11.25 assures that > = )oyso(—D)*, with 
radius of convergence equal to 1. Hence, item (b) of Proposition 11.26 gives, for 
|x| < 1, 


x 


= 1 k els 
log(1 +x) = A re KG 1)'# dt = a oe : (1129) 


Example 11.31 Note that 


1 
14+x 


= ag da a2? base 


for |x| < 1. Therefore, again by item (b) of Proposition 11.26, we have, for |x| < 1, 


tanx = I‘P* dt 
arctan x [me =e yt 


eB xP x! 


== 4 a pas, 11.23 
x ae “aa ( ) 


Problems 8 and 9 will show that (11.22) and (11.23) remain true for x = 1. 


Example 11.32 It follows easily from Corollary 11.24 that me pone series 
eo ax converges in the whole real line. If f(x) = ae nx forx € R, 
then item (b) of Theorem 11.27 furnishes 


1 1, 
f@= > kor = pa ae = f(x). 
k>1 k>0 


Therefore, 


£(EF() = FO) +F') = 0. 
IX 


so that x +> e*f(x) is a constant function. Finally, since f(0) = 1, we gete“f(x) = 
1 or, which is the same, f(x) = e*. 


We now apply Theorem 11.27 to show that, for any a € R \ {0} and |x| < 1, one 
can write (1 + x)® as a convergent power series. To this end, given aw € R and an 
integer n > 0, we define the generalized binomial number (‘) by letting ($) = 1 
and, forn > 1, 


(11.24) 


n n! 


(") _ e@-1@-2)...@-n+1) 
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The following lemma establishes some useful properties of generalized binomial 
numbers. As in Sect. 4.2, the property of item (a) is also known as Stifel’s relation. 


Lemma 11.33 Givena € Randn€N, we have: 
(a) (= (7) +0): 

(b) 5 (\) (“—}) for every a 0. 

(c) |(4)| <1 when |a| < 1. 


II 


Proof Item (a) is an easy computation: 


()-(") 


DDO ret) 
n! 


ee) ee 
n! 


“(a —1)(a~2)...(a—n-+ 1)(@ — (a —n)) 


1 
“Gem rele ee 


=) 


Item (b) follows immediately from (11.24). 
Finally, for item (c), if |a| < 1 then (11.24) and the triangle inequality give 


())- lal (lal + (lal +2)... el +m=1) V2 
n 


n! n! 
Oo 


The coming result is known in mathematical literature as the binomial series 
theorem, or simply as the binomial theorem, and is due to Newton. Notice 
that (11.25) generalizes (4.11), for ({) = Oifa ¢ Nandk > a. 


Theorem 11.34 (Newton) Fora 4 0 and |x| < 1, we have 


d+)7=>> (") (11.25) 


n+1 n 1 
= — il, 
la —n 
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Corollary 11.24 assures that eco Ce has radius of convergence equal to 1. 
Hence, by Theorem 11.27, the function f : (—1,1) — R given by f(x) = 
Y ke0 (*)x* is differentiable. Also from that result, 


f@= Dai} =~ (fe (11.26) 
k>1 k>1 


where we used item (c) of the previous lemma in the last equality above. Therefore, 


(1 +xf') =a +x) 0 (i) 


k>1 


= af (x). 
If g(x) = (1 + x) “f (x), then we have for |x| < 1 that 


¢@) =e +9)" @) + O43) 7 @) 
= (14x)! (-af (x) + (1+ )f'@)) = 0. 


Therefore, g is constant in (—1, 1). Since g(0) = 1, we get (1 + x) “f(x) = 1 for 
|x| < 1, as wished. 

For the general case, suppose that (1 + x)* = > ko (()x* for some a 4 0 and 
every x € (—1, 1). Let us show that similar formulas hold true for wa — 1 and a + 1 
(and every x € (—1, 1)): 


(a) Theorem 11.27, together with item (b) of Lemma 11.33, gives, for |x| < 1, 


(l+x)7! 


| 
a 
IV 
a 
> R 
| 
_—_ 
een 
| 
ll 
es 
mM 
a 
R 
x | 
= 
a 
> 
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(b) Item (a) of lemma 11.33 gives us 


d+) =+y>> (")« oe (")« +0 (‘Je 


k>0 


Ee) PE (t 


ay. ( st 


k>0 


By induction, we conclude that (11.25) is true for all aw 4 0 and |x| < 1. Oo 
Corollary 11.35 Given a, B 4 0, we have 
(1+ py? = >> {" (pn! 
k 
k>0 
for every x € R such that |x| < TE 
Proof It suffices to apply (11.25) with Bx instead of x, noticing that |Bx| < 1<¢ 


1 


Example 11.36 As an application of the corollary above, note that for |x| < ; 
we have 


(I-27? = > (2) (—2x)*, 


k>0 
with 
=1/2 1/1 i 
( n =a G-) (-3-"+1) 
ely 13 5 2n—1 
ae 
(-1)" (2n)! 


n 


(-1)" (2n)! _ (-1)" f2n 
n! 2”-2"n! 4” 


Hence, for |x| < 5 we have 


sarap (ae 
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We finish this section by presenting a result on the extension of a power series to 
the endpoints of its interval of convergence. To this end, we first need to discuss a 
lemma which is interesting in itself. 


Lemma 11.37 Let f(x) = Veo a,x* be defined in the interval (—R, R), with a, > 
0 for every k = 0. Lf 0 a,R* converges, then 


Jim f@) = So aR". 


k>0 


Proof Since a, > 0 for each k > 0, we have, for0 < x < R, 


Do ak — F(a) = DF a(R — x4) > 0. 


k>0 k>0 

On the other hand, 
> a,R* — f(x) = = a,(R* — x*) + > a(R” =x") 
k>0 =0 k>n 


where we used again the fact that a, > 0 to obtain the above inequality. 

Now, given « > 0, the convergence of )°,.5 a,R* assures the existence of 
no € N such that i a,R’ < e€ for n > no. Fix such ann > no. Since 
lim, r— fo ax(R* — x*) = 0, there exists § > 0 such that 


xe(R-6,R)> S > a(R — x‘) <e€. 


k=0 


With such choices and in view of the above computations, we conclude that 


x € (R—8,R) > 0< >) aR‘ — f(x) < 2¢. 
k=0 


Oo 


Remark 11.38 In the notations of the statement of the previous lemma, Problem 7 
shows that the assumption on the sign of the a;’s can be dropped. 


It is somewhat surprising that the converse of the previous lemma also holds true. 
Such a result is known in mathematical literature as a tauberian theorem after the 
work of Alfred Tauber, Austrian mathematician of the XIX and XX centuries. For 
the sake of simplifying the notation, we assume that R = 1, leaving to the reader 
the (easy!) task of dealing with the general case. 
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Theorem 11.39 Ler f(x) = doiso a,x* in the interval (—1,1), with a, > 0 for 
every k = 0. Iflim,_,\— f(x) does exist, then )°)5. a; converges and lim,-,\— f(x) = 


Veo ag. 


Proof The last part follows from the previous lemma. For what is left to do, note 
that, forO0 < x < 1, 


girl] 


xf" (x) = SY kayx* = > S kayx* 


ke 1 j20 k=2) 


git+l_y 


; yi tl 
= ) ) Bax 


J20 k=2/ 


gti} 


| > aie, 


j20 k=2) 


In particular, for a fixed natural number / and 0 < x < 1, we have 


7 l grey} 
i +1 
/ if’ (dt > Y 2! > a) f pom 
j=0 k=2) 
l att} ait! 
= yj 
= pee: ( ak Qi+1 
jJ=0 k=2) 
1 1 itty 
ee git 
ae) a Js 
J=0  k=2) 


Now, let a = lim,_,;— f(x). Since f is increasing and nonnegative, we get 


ie tf’ (dt = xf (x) -— [rou <a. 
0 0 


Taking these two estimates together, we arrive at 


1 ty 

j+1 
) ( ) ax) <2 
j=0 k=2) 
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Finally, since the sequence of partial sums of the series }°,.., ax is nondecreas- 
ing, this last inequality gives 


ba ag < do + 2a. 
k>0 
Oo 


Example 11.40 By applying the formula of Problem 11 for the radius of conver- 
gence, we conclude that (—1, 1) is the interval of convergence of the power series 
> i20 x’, Alternatively, we can observe that, for |x| < 1, 


0< Se < > |x|* < too. 


20 k>1 


On the other hand, since the sequence of the coefficients of this series clearly 
diverges, the tauberian theorem assures that 


For a more refined estimate on the growth of wees as x — Il-, see 
Problem 12. 


Problems: Section 11.3 


1. Compute the radii of convergence of the power series given below: 


1 
(a) a maT 
(b) +> ko e* xk 
k 
(c) Veo Die : 
2. Compute the radius of convergence of the power series )~ kot ae 


3. Use the approach of Example 11.32, together with the result of Example 9.48, 
to show that 


sinx = ) CY ge and cosx = ) es 1h oe 
(2k — 1)! 
k>0 k>0 


4. * Fork € N and |x| < 1, prove that 


1 ktn—-1)\, 
ed ( n )y. 
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5. Show that for |x| < 1 we have 
1 2k 
arcsin x = = aT er 
a0 MOK+ N\A 


6. The formula of Example 11.30 can be modified in order to compute log a for 
every a > 0. To this end, do the following items: 


(a) Show that, for x € (—1, 1), one has 


1 1+x ee x! 
= log 4 =X SP Sb Stee. 


2 1 3 5 7 


(b) Show that x bh oe defines a bijection from (—1, 1) onto (0, +00). 
(c) Use the formula of item (a) to compute log3 with four correct decimal 
places. 


7. * Let f : (-R,R) > R be given by f(x) = ois a,x*. The purpose of this 
problem is to show that, if }~ ko a,R* converges, then 


Tim f(x) = 0 ar". (11.27) 
x—>R- k=O 
To this end, do the following items: 


(a) Let r, = es a,R* and, for |x| < R, let y = z- Show that, for natural 
numbers n < m, 


m 


m 
= ax = (rny" - Im+1y") + > rey a yo), 


k=n k=n+1 


(b) Show that, for natural numbers n < m, 


m 


| Yo aux < ITn| + ITm| + (sup rel) 0" —y"). 
k>n 


k=n 


(c) Show that Deo a,x* converges uniformly on the interval [0, R]. Then, 
conclude that (11.27) is true. 


8. Use the result of the previous problem to show that 


pata ee 
(@) — -_- -- = see, 
e 7°3 A 
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9. * Use the material of this section to give a self-contained proof of the Leibniz 
formula for zr: 


Te oes Val Ls 
Ape 35 7 ; 


For the next two problems, we extend the notion of supremum by saying that 
a sequence (ax)x>o0 has supremum +00 if it is unbounded from above; in this 
case, we write 


sup{az; k > 0} = +00. 


Accordingly, we extend the notion of infimum by saying that a sequence (ax)x>0 
has infimum —oo if it is unbounded from below; in this case, we write 


inf{a,; k => 0} = —oo. 


10. Let (ax)¢>0 be a sequence of real numbers. Define its limit superior, denoted 
lim sup ax, by letting 


lim supa, = lim sup{aj, aj41,...}. 
j>+00 


(a) Prove that limsupa;, is a well defined concept and that, if s; = 
sup{aj,aj41,...} € RU {+00}, then limsupa, = inf{sj; 7 > O} € 
RU {+00}. 

(b) If lim sup a, € R, prove that it is the only real number M satisfying the two 
following conditions: 


i. Given € > 0, there exists infinitely many n € N such that a, > M—e. 
ii. Given € > 0, there exists at most finitely many n € N such that a, > 
Mr+e. 


11. Let es a,(x — xo)* be a power series with radius of convergence R. The 
purpose of this problem is to prove that 


1 


Ree, 
lim sup ¥/ |a,| 


where the right hand side is to be interpreted as being equal to 0 if 
lim sup </|az| = -+oo. To this end, assume without loss of generality that 
Xo = 0, let R be given as above and do the following items: 


(a) Fix 0 < r < Rand taker < s < R. Show that there exists ky € N such that 
lax| < + for every k > ko. Then, show that 


In}<r> lax] < 0 (<) < +00. 


k>ko k>ko 
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(b) Fix |x| > R and show that there exist infinitely many indices k € N such 
that |ayx*| > 1. Conclude that the series )>,.9 apx* diverges. 

(c) Conclude that R, defined as above, is indeed the convergence radius of the 
given series. 


12. As we saw in Example 11.40, the power series ei x” defines a differentiable 
function f : (—1, 1) > R. In this respect, prove that: 


(a) af'@) > Dae s* = 4, for0 <2 <1. 
(b) f(x) > —log(1 — x) —x, for0 <x < 1. 


11.4 Some Applications 


In this section, we briefly discuss the use of power series in Algebra and differential 
equations. For further applications of power series to Algebra and Combinatorics, 
we refer the reader to [5]. 

We begin with the following 


Definition 11.41 The (ordinary) generating function’ of a sequence of real 
numbers (a;,)n>0 is the power series 


> aux. (11.28) 


k>0 


The previous definition suggests that the main difference between the theory of 
power series and the method of generating function lies in a change of point of 
view. In the first case, we are primarily interested in examining the properties of the 
function f defined by the power series at the right hand side of (11.28); in the second 
(as we shall see next), we want to use the properties of f to infer conclusions about 
the terms of the sequence (a,)n>0. 

Let’s illustrate this by revisiting Problem 20, page 220 with the aid of generating 
functions. 


Example 11.42 (TT) The set of naturals is partitioned into m disjoint, infinite and 


nonconstant arithmetic progressions, of common ratios dj, dz, ..., dm. Prove that 
: + : af eeseet= : 1 
d\ dy dm 


Proof If f(x) = Vel x* and a; is the initial term of the i-th AP (that of common 
ratio d;), then the given condition, together with item (e) of Example 11.25, gives 


7In opposition to exponential generating functions, cf. Chap. 3 of [5], for instance. 
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for |x| < 1 that 


m Me xi 
f(x) = ie 4 iti bit 2di + see) — S a, 


i=] i= 
Since 1 — x4 = (1—x)(1+x+2° +---4 x41), multiplying both sides of the 
equality above by | — x we get 


m 


xi 
= a 11.29 

= ai pee reer ( ) 
for |x| < 1. Now, both sides of this last equality define functions continuous in [0, 1] 
and which coincide in [0, 1); hence, such functions coincide for x = 1 too, so that, 
letting x = 1 in (11.29), we obtain 72, 4 = 1. Oo 


The use of generating functions is particularly useful in the study of sequences 
(@n)n>o Satisfying a given linear recurrence relation. Here, as in Sect. 3.2, we shall 
treat the cases of (certain) linear recurrence relations of orders 2 and 3, postponing 
the analysis of the general case to Chap. 20 of [5]. 

The idea is to consider the generating function }°,.5 a,x* corresponding to the 
given sequence and, then, follow the various stages below, which comprise a sort 
receipt for several similar problems: 


I. Use the initial terms of the sequence, as well as the recurrence relation it 
satisfies, to conclude that the given generating function converges at some 
interval of the form (—r,7r). 

II. Again with the aid of the initial terms and the given recurrence relation, 
perform appropriate (generally algebraic) operations with the equality f(x) = 
Ys aex* to get a formula for f (x). 

III. Develop the formula obtained in item II in Taylor series. 

IV. Use the uniqueness of the power series representation of f, given by Corol- 

lary 11.29, to conclude that a, equals the coefficient of x” in the power series 


‘ A e(n) 
expansion of stage III (.e., a, = f 0), 


In order to go through stage I, the following result is frequently useful. 


Lemma 11.43 Let (dy),>0 be a sequence of real numbers. If there exist positive 
reals cand M such that \a,| < cM" for every n > 0, then the power series a, ayxk 


converges in the interval (-z. x): 


Proof Since |a,x"| = |dyl||x|" < cM"|x|" = c|Mx|" and the geometric series 
> ps0 |Mx\* converges when |x| < u (cf. item (e) of Example 11.25), the 
comparison test for series guarantees that )~>,.  a,.x* converges when |x| < i: Ll 


The following example implements stages I through IV above to get a positional 
formula for the n-th Fibonacci number. 
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Example 11.44 Let (F,,),>1 be the Fibonacci sequence, so that F; = 1, F, = 1 and 
Fy42 = Fy41 + F; for every integer k > 1. Compute F,, as a function of n. 


Solution Let f(x) = D031 7 .x* be the generating function corresponding to the 
Fibonacci sequence, and let’s run through the previously listed stages I to IV. 


Step I: note first that an easy induction gives F,, < 2” for every n > 1. 


Hence, Lemma 11.43 guarantees that the power series )°,., F; .x* converges in 


the interval (4, 5). 


Step IT: for x € (-3, 3), we can write 
f(x) = Fixt+ Fox + >> Fit 
k>3 


=xtx4+ Yo Fe + Fy) 


k>3 


=xt+x+ xy ea +x So Fea? 


k>3 k>3 
=x+x°4x(f(x) — Fix) +f (x) 


=x+(e+x)f(). 


Then, for x € (-5, 5) we have 


Xx 


i) > Fa, oe 


x— x? 


Step III: writing 1—x—x* = (1—ax)(1—x), witha, B € R, we haveat+B = 1, 
ap = —1 and 


Foe Gand me. 


Imposing, with no loss of generality, thata > 8, we geta = eae and B = 
lV so that a — B = 4/5 and, thus, 


. 
1-9 ta ren) 


Now, developing —_ and ck as geometric series, we obtain 


2 k ar S ak — pe 
£0) = = | Den! - Dew ->( = 2 


k>0 k>0 
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as long as |x| < min |}, a int = 5. Then, 


for every x € (4, 3). 
Step IV: finally, Corollary 11.29 assures that the power series expansion of f is 
unique, so that the last equality above, together with the initial definition of f, 
gives F,, = oF for everyn > 1. 

oO 


Proceeding similarly to the above example, we now use generating functions to 
give another proof of Theorem 3.16 (see also Problem 6). 


Theorem 11.45 Given u,v € R, with v 4 0, let (dn)n>1 be such that ayo = 
udy+1 + vax for every k => 1. Moreover, assume that the characteristic equation 


x? — ux —v = Ohas real roots a and Bp. 


(a) Ifa # B, then a, = Aa"' + BB"! for every n = 1, where A and B are the 
A + B = aj 

aA + BB = a2 ; 

(b) Ifa = B, then a, = (A + Bn)a""! for n = 1, where A and B are the solutions 
A + B= a\ 

(A + 2B)a =a) , 


solutions of the linear system 


of the linear system 


Proof We start by showing that there exists g > 0 such that |a,| < gq" for every 
n > 1. Indeed, assuming that |a,| < g* and |az41| < g‘*!, it follows from the 
triangle inequality that 


[azo] = [wanes + var| < |ullartil + |v|lael < lula“! + lola’, 


so that |az42| < q**? provided |u\q't! + |v|q* < q**? or, which is the same, 
|ulg+|v| < q’. Since the greatest root of the second degree equation x?—|u|x—|v| = 
0 is x9 = ; (i + fue + aul), we get |u|g + |v| < q? whenever g > xo. Hence, 
|an| < q" for every n > 1, as long as |a;| < gq, |a2| < g? and g > Xo, for which it 
suffices to choose g > max{|ay|, \/|a2|, xo}. 

Fix such a q, so that |a,| < g" for every n > 1. Lemma 11.43 guarantees that 


the generating function )~ k=l a,x* converges in the interval (-4, 1), thus defining 
fF: (-2, 1) > Rbyf@ = Vet ax". 


In order to get a simpler expression for f(x), we argue as in the previous example, 
using the recurrence relation satisfied by the sequence (a;)x>0 of the coefficients 


of f: 
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f(x) = ya =ayx+anx* + as 


k>1 k>3 


= ayx + ayx? + (way) + vaya)" 


k>3 
=ayx+ dx" + ux ) ap x! + vx ) pox? 
k>3 k>3 


= ayx + anx’ + ux(f (x) — ayx) + vx (x). 


Therefore, (1 — ux — vx*)f(x) = a,x + (a2 — ua;)x’ for every x € (-2. 1), 
Now, since a and f are the roots of x7 — ux — v = 0, we geta + B = wand 


aB = —v. Hence, 1 — ux — vx* = (1 — ax)(1 — Bx), so that 


_ axt (a2 — uay)x* 
f@M= ‘G—aol pe: (11.30) 


for |x| < min{2, a aE 


Let’s look separately at cases (a) and (b): 
(a) If a # B, we decompose the right hand side of (11.30) in partial fractions,® 
ie., we take A, B € R such that f(x) = — + am Obviously, such A and B 
A + B = aj, 
BA + @B = ua, — a 
system has a single solution, exactly because a # f. 
Expanding a and LE in geometric series, we get 


must satisfy , and one promptly notes that such a linear 


f(x) = Ax d(x) + Bx (Bx) 


k>0 k>0 

=) ok + BBY‘! 
k>0 

= oak + BBE), 
k>1 


Finally, comparing this last expression with the fact that f(x) = ye ayx*, we 
obtain a, = Aa‘! + BB! for every k > 1. 


(b) Ifa = B, then 2a = u,a? = —v and f(x) = aurhoo—asi)e This time we 


try to decompose the right hand side of (11.30) in partial fractions by writing 


8 general theorem on the existence of partial fraction decomposition for quotients of polynomials 
will be seen in [5]. For the time being, we shall content ourselves to describe what such a theorem 
says in the simple cases we consider here. 
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f(x) = “+ + —*.,,, for some A,B € R. Obviously, such A and B must 


l—ax (l—ax)? ” 
A + B = aj, 
aA = ua, — a 
forv 40>a £0. 


satisfy , and again this linear system has a unique solution, 


Expanding —_ and asd i power series (for the second fraction with the aid 
of Problem 4), we get 


f(x) = Ax )(ax)k + Bx 0 (k + Dax) 
k>0 k>0 
= xe + Blk + 1))orkxk*! 
k>0 


= 0A + Bhat!" 


k>1 


Finally, arguing as in the previous case we get a, = (A + Bk)a*! for every integer 
k>1. Oo 


Generating functions can also be used to deal with linear recurrence relations 
with nonconstant coefficients. To present such an example, we first need the 
following result. 


Proposition 11.46 [f f,g : (xo — R,xo + R) > R have power series expansions 
FX) = Vege o ae — xo)* and g(x) = Veo bie — xo)!, then the function fg : (xo — 
R,x9 + R) > R has power series expansion (fg)(x) = eer Cn(x — X09)", with 
Ch = arr aby = pa aDn—K. 


Proof Theorem 11.23 assures that the power series defining f and g converge 
absolutely in the interval (x9 — R, xo + R). Hence, by Theorem 7.53 and for every 
x € (X%) —R, x9 + R), we have 


FRB) = | D5 axle — 2x0)" | | So bre = x0)! 
k>0 I>0 
= 2G ( ss a(x — x9) +» B(x -») 
n>0 \k+l=n 


= a (x an) (= x)" 


n>0 +l=n 


= >» Cn(x — xo)". 


n>=0 
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Finally, note that Theorem 7.53 guarantees that the convergence of this last series is 
also absolute in the interval (xo — R, xo + R). oO 


Example 11.47 Let (an)n>o0 be the sequence such that a9 = 1, a; = —1 and 


Ak-1 
= ——_ + 2a, 
ak k + 2ap-2 


for every integer k > 2. Compute a, as a function of n. 


Solution Denoting by f(x) = oiso a,x* the generating function of the given 
sequence, let’s once again run through the previously described stages I to IV. 


Step I: once more we shall try to apply the comparison test for series, in the 
form of Lemma 11.43. Assuming that |a,_2| < a~?, |ag_1| < o*!, the triangle 
inequality gives 


k-1 
ar— a 
|ax| < ah + 2a,-2] $ — + 2a, 


; k= _ pas 
hence, we shall have |a,| < a provided S— + 2a‘~? < at or, which is the 


same, 


aie ee 
i <a’. 


Since such an inequality is true for @ = 2 and every k > 2, and |ao| < 2°, 
|a,| < 2!, it follows by induction that |a,| < 2" for every integer n > 1. Hence, 
the above mentioned lemma, together with the theory of power series, guarantees 


that f is infinitely differentiable in the whole interval (-4 5). 


272 
Step IT: writing the given recurrence relation as kag = —ag—1 + 2kay—2 fork = 2, 
we get 
f°) = Yo kay! = ay + YO kay! = ay + Y (ay + 2kaya)x 
kel k>2 k>2 
=ay— a + eae kapox”* 
k>2 k>2 


II 


a, — (f(x) — ao) + 2x Yk _ Q)ap—ax*? +2 = pax? 


k>2 k>2 


a, + ay —f (x) + 2x(f’(x) + 2f()). 


Taking into account that a, + ao = 0, we get f’(x) = (4x — 1)f(x) + 2xf"(x) or, 
which is the same, 


(2x — Df'(@) = —(4x— Df). 
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In order to solve such a differential equation, first note that f is positive in the 
interval (—r,r), for some 0 < r < 5 (for, f(0) = agp = 1 > O and f continuous 
imply f positive in some neighborhood of 0). Therefore, for |x| < r we can write 


f@) _  4x-1_ 1 


#Q) ~~ 2ea1 2x—1 


so that, for |x| <r < 5, 


loess) =togrto|_, = [OO 


% 1 
s=] (24——)a 
i ( +s) 


1 
= —2x-— 5 ost — 2x). 


dt 


Thus, for |x| <r < 5 we have 


f(x) =e 7 (1 — 2x)”. (11.31) 
Step IIT: note that the power series expansion of e~?* is given by (11.7), with 
a = —2, and holds in the whole real line: 
k 
—2x __ (—2) k 
aoe k! 
k>0 


Hence, it follows from Example 11.36 and Proposition 11.46 that, for f given as 


in (11.31) and |x] < r< 5, we have 


f(x) = 


a (—2)* 1 21 n 
->(¥ Sa?) x 


k+l=n 


| 
al 
Zhe 
— 
= 
L 
2|- 
ya 
ei 1 
NS 
= 
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Step IV: comparing the last expression above with f(x) = )~,,.9 dnx", it follows 
once more from the uniqueness of power series expansions that 


anv CI [2 


oO 


All of the above results can be seen as sorts of discrete versions of the following 
result on ordinary differential equations: 


Theorem 11.48 Letxp €¢ R, R > Oandp,q: (xo—R, x0 +R) — R be given by their 
Taylor series centered at xo. For a, B € R, there exist a unique twice differentiable 
function f : (x9 — R, x9 +R) > R such that 


f" + pf + af =0 11.32 
f(x0) = 0, fo) = B’ oe 


Moreover, f is actually infinitely differentiable and given by its Taylor series 
centered at xo. 


Interesting (and important) applications of the above result can be found, for 
instance, in Chap. 6 of [2], or in Chap. VI of [19]. We now discuss a particular case 
of it, referring to Problem 11.48 for a proof of the general case. See, also, Problem 4, 
page 510. 

Given real numbers a and b and a twice differentiable function f : R > R 
such that 


f’ + af +bf =90, (11.33) 
we Say that the second degree equation 
V+ak+b=0 (11.34) 


is the characteristic equation of (11.33). The discriminant A = a” — 4b of (11.34) 
is said to be the discriminant of (11.33). 


Theorem 11.49 Let a and b be given real numbers and f : R — R be a twice 
differentiable function satisfying (11.33). In the notations of the discussion above, 
there exist real constants A and B such that 


F(x) = Afi) + BQ) 


for every real x, where: 


(a) fi(x) = e7? and fy(x) = xe? ifA = 0. 
(b) fi(x) = e® and fy(x) = eF* if A > 0 and a and B are the roots of (11.34). 


ax 


(c) fi(x) = e772 cos (vee) and f(x) = e7 2 sin (2) ifA <0. 


2 


In particular, for given values of f (0) and f'(0), (11.33) has exactly one solution. 
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Proof First of all, writing f” = —af’—bf it is immediate that any solution of (11.33) 
is infinitely differentiable. 


Claim I f is given by its Taylor series in the whole real line. 
Fix M > 0 and choose C > 0 such that |f| < C and |f’| < C’ in [—M, M]. Since 
f&? + af) + bf = 0 for every k € Z+, we get 
pO] < fall FY] + IFO. 
Thus, if |f| < C/T! in [-M, M] for 0 <j <k +1, then 
[per < lace"! as |b|c* < Cet? 


provided C? > |a|C + |b]. Therefore, if we choose such a C from the beginning, we 
conclude that 


[f| < C’*! in [-M,M], V n> 0. 


Now, for x € [—M, M], Taylor’s formula gives 


n=l p(k Q (n) 
fo =F 7 de yt LO. (11.35) 
oi ! 


for some c € [—M, M]. However, since 


ME Beg 


(n) ctl 
TO a) 6 OF gr 
n!} n! n! 


letting n — +00 in (11.35) we conclude that f(x) = ipso LO yk in [—M, M]. 
Actually, since M > 0 was arbitrarily chosen, this holds for every x € R. 


Claim 2 for given values of f(0) and f’ (0), there is at most one solution for (11.33). 
(This claim will also establish the uniqueness part of the theorem.) 
For simplicity, write f(x) = )7,.9 #.x*. It follows from Theorem 11.27 that 


f= a ai =i al =) oe 


k>1 k>0 


and, analogously, f”(x) = )°,.) “4¢*x*. Substituting these expressions into f” + 
af’ + bf = 0, we find 


xt 
Cae + adage, + bax), = 0, 
k>0 . 
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and the uniqueness of power series expansions gives ay42 + daz41 + ba, = 0 for 
every k > 0. However, since aj = f(0) and a, = f’(0), the sequence (a,)x,>0, and 
hence the function /, is completely determined. 


Finally, to finish the proof it suffices to check, in each of the cases (a), (b) and 
(c), that: (i) the given functions f, and f satisfy (11.33); (ii) for any given values for 
f(O) and f’(0), one is able to find real values for A and B such that 


Af (0) + Bfr(0) = f(0) 
Afi(0) + BAO) = f’(0) 


Once this has been done, the uniqueness of solution, as established in Claim 2, 
allows us to conclude that f = Af, + Bf. 

Checking items (i) and (ii) in each of (a), (b) and (c) is actually quite simple. For 
item (c), for instance, item (ii) amounts to showing that the system of equations 


a V—-A 
A=f(0), -~A+——B=f'(0) 
2 2 
always has a solution, which is immediate. As for (i), one just needs to compute f/ 
and f’ fori = 1, 2, then showing that f’ + af/ +bf; = 0. We leave this as an exercise 
to the reader. Oo 


Remark 11.50 Note that the prescription of the values of f(0) and f’(0), instead of 
f(a) and f’ (xo) for some xo € R, is irrelevant. Indeed, letting g(x) = f(x + x0), we 
have g(0) = f(x), g’(0) = f’(%o) and g” + ag’ + bg = 0, so that we can apply the 
previous result to find g and then find f(x) = g(x — x0). 


Example 11.51 Find all differentiable functions f, g : R — R such that f(0) = 0, 
g0) = 1, f’ =f + gand g’ = 2f. 


Solution Since f’ = f + g, which is a differentiable function, we conclude that f is 
twice differentiable. Then f” = f’ + g’ = f’ + 2f or, which is the same, 


f =f of =0. 


Now, the second degree equation A” — A — 2 = 0 has real roots 2 and —1, so that 
the previous theorem gives f(x) = Ae** + Be~ for every real x. Since f(0) = 0 and 
f'(0) = f(0) + g(0) = 1, we getA + B = f(0) = 0 and 2A — B = f’(0) = 1. 

Solving such a system of equations, it follows that A = ; and B = —i, so that 
f@) = i (e* —e~™) and g(x) = f’(x) —f(@) = 3 (ec + 2e~*). o 
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Problems: Section 11.4 


1. Compute the number of nonnegative integer solutions of the equation 
a, +42 +++ +a =m, 


where & and m are given natural numbers. 

2. Compute the number of nonnegative integer solutions of the equation a; + a2 + 
a3 + a4 = 20, such that a; > 2 and a3 < 7. 

3. Generalize the result of Proposition 11.46, showing that if f : (4o—R, x9 +R) > 
R is given by f(x) = )o,s9 4n(x — x0)", then for k € N we have f(x)* = 
> 0 Cn(X — Xo)", where ~ 


Ch = ) Qj, jn... Giz 


and the above sum extends over all k-tuples (ii, ..., ix) of nonnegative integers 
satisfying i) +--- +i =n. 
4. In the notations of the discussion of Example 11.42, prove that 


5. Use generating functions to find a, as a function of n, where (a,),>1 is given 
by a, = 2anday4, = a, + (K+ 1) fork > 1. 

6. Use the methods of this section to give another proof to Theorem 3.19. 

7. The sequence (a,)n>0 is given by dj = 1 and 4,4; = 2a, + nforn > 0. In 
order to compute a,, as a function of n, do the following items: 


(a) Ifa, < a”, show that a,4, < a”t! as long as a” (a@—2) > n; then, conclude 
that a, < 3” for every n = 0. 


(b) Show that the generating function of (a,),>9 converges in the interval 
1 eee: “oy 2 
(—. +) and is given by f(x) = Toon: 


(c) Find real constants A, B and C such that 


1-2x+2 A a B i re 
(=.=) (l= " 12" Toe 


(d) Expand each of the functions of the right hand side above in power series 
to conclude that a, = 2"! — (n + 1) forn > 0. 


8. Let (an)n>o be given by dp = 1, a) = —3, az = 5 and agy3 = ayer + Sak+1 _ 
Fak for every integer k > 0. The purpose of this problem is to show that (ay) n>0 
converges and to compute the corresponding limit. To this end, do the following 
items: 
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(a) Show that |a,| < 5” for every integer n > 0. Then, conclude that the radius 
of convergence of Ss ayx* is at least k. 

(b) Iff: (-4, +) — Ris given by f(x) = Dips a,x‘, use the given recurrence 

15x?—8x+2 

(x=) @?—=2)* 


(c) Writing f(x) = “A + oar; + ai show that A = —9. Then, expand — 


1 1 : 
j= in 5 power series to get 


relation to get f(x) = 


|—-— 
B+ a} 
x)= = —— 
F(x) 2, ( (/2)k+! 


(d) Conclude that a, = —9 — ae for n > 0, and hence that a, a), 


(fay 


. (Putnam) Let u,v, w: R > R be given by the power series expansions 


3 6 9 


Xx Xx Xx 
x x! x10 
UO) Sa ign 
x2 x x8 xl 
WO Saat a Pan 


Prove that, for every real x, one has 


u(x)? + v(x)? + wx)? = 3u(x)v(x)w(a) + 1. 


. The purpose of this problem is to give a proof of Theorem 11.48. To this end, 


let 


P(x) = > by(x — x0) and q(x) = y cx (x — x0)* 


k=0 k>0 

in the interval (xp — R, x9 + R) and do the following items: 

(a) If f@) = yes a,(x — xo)* in the interval (x9 — R, xo + R) and f satisfies 
f" + pf’ + 4f = 0, show that 


k 


(k + 2)(k+ Vagro + Yo (G+ Ddi-jajr1 + cxjaj) = 0 
j=0 


for every k > 0. Then, conclude that az, a3, a4, ...are uniquely determined 
by ao = f (x0), a1 =f’ (x0), p and gq. 
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(b) Let ag = a, ay = B and0 <r < Rbe given. If |x—xo| < r,A = 2 |x — xol 
and Ay = |ax(x — xo)*| for k > 0, show that 


k 
(k+ 2)(k+ WAng2 < DOG + Vibe lhTj41 
j=0 


k 
+O leg IIH A; 


j=0 


(c) Let B,C > 0 such that |b;|r* < B and |c;,|r* < C for every k > 0. If 
M = max{Br, C}, show that 


k+1 
(k+ 2)(k + Argo <M G+ Data, 


j=0 
(d) Let (Ax)x>0 be defined by Ay = Ao, Ay = A; and 


k+1 
(k + 2)(k+ DAgpo = MOG + DAMTTIA;. 


J=0 


Show that Ax < Ax for every k > 0. 
(e) If Ao 4 0 or Ay ¥ 0, show that Ay, > 0 for every k > 2. Then, show that 


Agta  AR(K+ 1) +M(k+2) & 
—— Sh 1. 
Akt (k + 2)(k+ 1) 


(f) Conclude that >> hae converges, and that this implies the convergence of 
> ¢s0Ak- Then, note that this is the same as saying that )>,. 9 ax(x — x0)* 
is absolutely convergent in (x9 — r, Xo +r). Finally, show that f, given as in 
(a), is well defined and does satisfy (11.32). 
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In this section, we show that a function defined by a power series is analytic, i.e., is 
given by a convergent power series around each point of its interval of convergence. 
We also study the set of zeros of an analytic function and use the results we get to 
give purely analytic derivations of the properties of the sine and cosine functions. 
Our purpose in doing so is to show the reader that all of the arguments we 
have done so far concerning these functions are actually independent of any 


500 11 Series of Functions 


considerations on the geometry of circles. As we anticipated in Remark 9.12 and 
now reinforce in a slightly different way, this is not a simple pedantism, for: 


(i) the proof we have given for the formula sin’x = cosx relied upon the 
fundamental trigonometric limit (cf. Lemma 9.11); 
(ii) in turn, the proof of that result made use of the formula for the area of a circular 
sector; 
(iii) then, in Example 10.43 we computed the area of a circle of radius R by using 
the change of variables formula to reduce it to a simple application of the FTC, 
which relies upon the fact that sin’ x = cos x. 


Hence, if we are not able to free the properties of the sine and cosine functions 
from the geometry of the circle, we will be forced to conclude that all of the 
arguments related to items (i), (11) and (iii) above—and, therefore, to a large part 
of this book—are totally fallacious. 

We begin our presentation with the following 


Definition 11.52 Let J C R be an open interval. A function f : ] > R is analytic 
if, for every xo € J, there exists R > 0 and a power series Yeeeo ayx* such that 
(xo — R, xo + R) C Jand f(x) = Yeeeo ayx* for every x € (x) —R, x9 + R). 


As we pointed out in the beginning of this section, a function defined by a power 
series in an open interval (x) — R, x9 + R) (possibly with R = +00) is analytic. In 
order to prove this, we need the following result on double series which is interesting 
in itself. 


Proposition 11.53 For each j,k € Z4, let ay € R be given, such that ) 430 \ajxl 
converges for each j = 0 and Yi>0 Deeeo |aj.| also converges. Then: 


(a) Yeo aj converges for each j = 0 and Yeo ajx converges for each k > 0. 
(b) Leo eo Ge and Driso Yiro aj. converge and 


>a = Yo aw. (11.36) 


j20 k>0 k>0 j>0 


Proof First of all, given m,n € N, our hypothesis gives 


n m m 


n m 
dod lawl = DoD lawl < DD lawl SDD lawl < +00. 
k=0 j=0 j=0 k=0 j=0 k>0 j=0 k>0 
Hence, letting m — +00 we get 


YS lanl < D5 YE lanl < +00, 


k=0 j>0 j20 k>0 
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for every n € N. Therefore, et |aj| < -++oo for every k > 0 and, letting n > +00 
in the above inequality, we get 


dod lawl = DL laiel < +00. 


k>0 j>0 j20 k>0 


Thus, > o ps0 i> |x| also converges. 
Now we can prove (a) and (b), except for (11.36): since 0 |aj.| converges for 


every k > 0, itis also true that SZ 


jo “jk converges for every k > 0, with bas ax| < 


Yeo |ajx|. Summing this inequality over k > 0, we get 


>| Xan < OYE la| < +00. 


k>0 j>0 k>0 j>=0 


Therefore, ees ys j>0 Gk iS absolutely convergent, hence convergent. Arguing in 
an entirely analogous way, we conclude that both of the series }°,.,) aj and 
Viso Djso Ge are (absolutely) convergent. 


In order to prove (11.36), let A = | Deo Deo Gk — Dero Deo ax; Then, for 
m,n € N, we have 


m 


oS het Dhara ral 
j=0 k>0 j>m k>0 k=0 j>0 k>n j=0 
= LDH LE a|+|C Dal +| ode 
j=0 k>0 > j>m k>0 k>n j>0 


Let € > 0 be given. Since i. Soxso la] and Dops0 Dojs0 lajx| converge, we 
can choose mg, mo € N such that 


m>m => >> >> |axl <« and n> => >) > lajl <e. 


j>m k>0 k>n j>0 


In turn, this gives | > em Ves ax < € and | bare ae, ait < €, so that, with 
m => mo and n > no, we get in the above estimate for A 
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A < 2 ae - OY an| + 2 = ae - DO Val + 2 


j=0 k>0 k=0 j>0 k>0 j=0 j>0 k=0 
s|yYo-Leal+|LLal+ Een +2 
k=0 j=0 = 
m n 
< Soe lait! +235 lapel +2€< Soe aie! + 29° lait! + 26 < 4e. 
k>n j=0 j>m k=0 k>n j=0 j>m k>0 


We are now in position to prove the following 
Theorem 11.54 If f : (xo — R,x0 + R) — R is given by the power series f(x) = 
so % (x—xo)*, then f is analytic. More precisely, for a given yo € (xy —R, Xo +R), 
there exist bo, bj, b2,... € R such that f(x) = a, by (x — yo)* in (vo —r,yo +17), 
where r = R— |yo — xo| > 0. 


Proof First of all, note that (vo — r,yo + r) C (% — R, xo + R). Now, for x € 
(vo —r. yo +r), we can write 


fe) = Yo ay(x — x0)" = Yo ax ((x— yo) + 0 — x)" 


k>0 k>0 
= Dad (joo 
k>0 j=0 


If we can change the order of the sums in the last expression above, we will get 
£9 = Z (Lal) oon) 
j20 k> 


Letting bj = Yop; a (‘) (vo — x0)*7, we will have 


f@) =) b@—yoy, 


Jz20 


thus finishing the proof. 
Therefore, we are left to justifying the equality 


»~ ak es ( ox — xo)* Tayo = os OS ak () (yo — x0)'7) (x — yo). 


k>0 j=0 j20 kaj 
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According to the last proposition, this will be true provided 


Erli( |b —-o x — yo < +00. 


k>0 j=0 


However, this expression equals 


Y- |ael (Ivo — x0] + [x — yol)' 


k>0 


which is finite, for |yp —xo| + |x—yo] < lvo—xo| +r = Rand )%j59 axuk converges 
absolutely for u € (—R, R). Oo 


We show next that the set of zeros of an analytic function f : J — R do not 
accumulate on I. To this end, we first need a formal definition of this concept. 


Definition 11.55 Given X C R anda € R, we say that a is an accumulation point 
of X if there exists a sequence (a,,)n>1 of pairwise distinct elements of X such that 
yn —> a. 

Theorem 11.56 Let I C R be an open interval, f : I > R be an analytic function 
and Z = {x € 1; f(x) = 0}. The following conditions are equivalent: 


(a) Z has no accumulation point in I. 
(b) There exists xo € I such that f (xo) = 0 for every integer k > 0. 
(c) f vanishes identically. 


Proof (a) => (b): let x9 € 7 be an accumulation point of Z, say x9 = limp—++o0 Xn, 
where (X,)n>1 is a sequence of pairwise distinct elements of Z. Since f(x,) = 0 for 
every n > 1, the continuity of f gives f(xo) = 0. 

By contradiction, assume that there exists an integer kj > 1 such that 
f© (xo) 4 0. Without loss of generality, we can assume that ko is minimum 
with such a property. By the analyticity of f, we can choose R > 0 such that 
(xo — R, xo + R) C J and, in such an interval, 


FQ) = oS 0) 6. — yt = (2 nye DES 2) — aay®, 


k>ko k>ko 


If g@)= yi o {2% (x — xo)‘, then the series which defines g converges in 


(xo — R, xo + R) \ {xo} for g(x) = ei in such an interval). Since it obviously 
converges at x9, we conclude that g is continuous in (xo — R, x9 + R), with g(x) = 


Eco) # 0. By the sign preserving lemma for continuous functions, there exists 
0 < 5 < Rsuch that g 0 on (% — 6, x + 8). Since f(x) = g(x)(x — xo) in this 
last interval, we conclude that f ~ 0 in (xo — 6, xo + 5) \ {xo}. But this contradicts 
the fact that x, € (xo — 6, xo + 5) \ {xo} for every sufficiently large n. 
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(b) > (c): let x9 € I be such that f(x) = 0 for every integer k > 0, and let 
R be the radius of convergence of the power series defining f around xo. Then, for 
x € (x9 — R, xo + R) we have 


® (x 
i= G50, 


! 
k=ko k! 


i.e., f vanishes identically in (xo — R, xo + R). 
Let’s use this fact to show f vanishes identically in JM [xo, +00). By contradic- 
tion, assume that there would exist a € IM [xo, +00) such that f(a) 4 0. If 


A = {x €1N [xo, +00); f(x) # 0}, 


then A # @ (since a € A) and xp + R is a lower bound for A; therefore, A has an 
infimum xo, which obviously satisfies x9 + R < x) < a. Hence, xj € J and, since 
XH is the greatest lower bound of A, we conclude that [x0. Xp) MA = @. This is the 
same as saying that f vanishes identically on [xo,.xp), hence on [xo, xp]. In turn, this 
gives that f (xp) = 0 for every integer k > 0. Now, an argument analogous to the 
one in the previous paragraph guarantees the existence of R’ > 0 such that f(x) = 0 
for every x € (xj — R’,x) + R’). Therefore, xo could not be the infimum of A, which 
is an absurd. 

Finally, by the same token one can prove that f vanishes identically in 7 
(—oo, Xo], so that f vanishes identically in J. 
(c) = (a): obvious! oO 


We now collect two straightforward consequences of the previous result which 
will be useful for what comes next. 


Corollary 11.57 Let I C R be an open interval and f,g : I > R be analytic 
functions. If f° (xo) = g” (xo) for some xo € I and every integer j = 0, then f = g. 


Proof Letting h = f — g, we get h (xo) = 0 for every integer j > 0. Since h is also 
analytic on J, the previous result guarantees that h vanishes identically on J, and this 
is the same as having f = g. oO 


Corollary 11.58 Let f : (—R,R) — R be an analytic function, possibly with R = 
+oo. If f vanishes somewhere in (0, R) but doesn’t vanish identically, then f has a 
smallest positive zero. More precisely, there exists a € (0, R) such that f(a) = 0 but 
f doesn’t change sign on (0, a). 


Proof Otherwise, there would exist a sequence (x,),>1 of zeros of f in (0, R), with 
xX, > xX. > +++. If xo = limy++oo Xp, then x9 would be an accumulation point of 
the set of zeros of f in [0, R), hence in its interval of convergence. By the previous 
result, f would vanish identically, which is a contradiction. Oo 


We are now in position to undertake a purely analytical development of 
Trigonometry. To this end, first notice that if sin and cos are to satisfy sin’ = cos 


11.5 A Glimpse on Analytic Functions 505 


and cos’ = — sin, then they are infinitely differentiable and ultimately must be given 
by their Taylor series centered at 0, as in Example 11.4. 

The key to free Trigonometry from Euclidean Geometry is to turn this around, 
defining sin and cos by means of those power series. Therefore, we start by letting 


f@a=>> gr and g(x) = D> CY (11.37) 


for every x € R. 
The analytic foundations of Trigonometry are the content of the four coming 
results. 


Proposition 11.59 With notations as above, we have that: 


(a) f and g are well defined and analytical, with f odd and g even. 

(b) fo) =f, FO» = gf _ —f, fr) =-g and g4) = g, ght =, -f, 
gtk+?) — _g gAk+3) — F, for every integer k > 0. 

(c) f and g are nonconstant and such that f(x)? + g(x)* = 1 for every x € R. 

(d) f(x +y) = f@)sy) +fO)g@) and f(x — y) = f@)80) —f0)8() for every 
x,y € R. In particular, f (2x) = 2f(x)g(x) for every x € R. 


Proof (a) The convergence of Yea ca together with the comparison test for 
series, assures the absolute convergence, in the whole real line, of both power 
series in (11.37). Then, Theorem 11.54 shows that they are analytical functions. 


Now, the oddness of f and the evenness of g follow from their very definitions. 


(b) Theorem 11.27 gives 
(19 ye CD 
/ = 2j-2 _ 2i = y 
PO= 2 ga” = Le ay” = 80 
j=l i=0 
analogously, g’ = —f. Now, f” = g’ = —f, g” = (-f)’ = —g and the given 
relations easily follow by induction. 
(c) It follows from (b) that (f? +27)! = 2ff’ +2ge’ = 2f¢+22(—f) = 0. Therefore, 


f* + g” is constant and, since f(0) = 0 and g(0) = 1, we get f(x)? + g(x)? = 1 
for every x € R. Also, f’(0) = g(0) = | and g’(0) = —g(0) = —1 assure that 
f and g are nonconstant. 

(d) Fora fixed y € R, let h(x) = f(x + y) and [(x) = f(x)g(y) + f) g(a). By the 
chain rule, h,/: R — R are infinitely differentiable. Moreover, by item (b) and 
the chain rule again, these functions satisfy 


h/+h=0onR ae l”“+1=O0o0nR 
h(0) = f(y), AO) =f’) 1(0) = f(y), /() = 80) 


Since f’(y) = g(y), the uniqueness part of the proof of Theorem 11.49 (which 
did not depend on any properties of sin and cos) guarantees that h = 1. 
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Now, the first formula, together with the fact that f is odd and g is even, gives 


Fx —y) =f@sCy) + f-y)s@) = fs) —fO)8@). 


Finally, the expression for f(2x) also follows from the first formula, once we let 
y=x. oO 


Proposition 11.60 The function f has a smallest positive zero. 


Proof Start by observing that, with item (b) of the above proposition at our disposal, 
the results of Problem 4, page 325 remain true for f and g in place of sin and cos, 
respectively, so that 


x x 
X— 3 S/O) Sx and a@) 21-5 
for every x > 0. In turn, this gives 


3 5 2 4 


ee oa and ej2i= pe 


3! = 5! 2! A! 
for every x > 0. For instance, for x > 0 one has 


3 


d 2. 4 
—(s@)-14+ 5-5) =F t+x-F <0, 


2! 4! 
so that g(x) 1+ 4 —% <g(0)-1=0. 
Now, the maximum of x — © for x > 0 is attained at x = ./2 (check it!). 


3 
Therefore, it follows from the above that 


jf 2/9 
Ff (v2) = V2- = —. 
3! 3 
On the other hand, the minimum of 1 — x + x for x > 0 is attained atx = V10 


(check it too!). Since f(x) < x(1 a - + =) for x => 0, this gives 


V1. a/10 > : 4/10 
f(V10) < VT0(1 = ane T) =~. 
Since a2 > 00. the IVT (applied to the intervals [0, /2] and [/2, /10] 


assures the existence of 0 < a < /2 < B < ¥/10 such that f(a) = f(B) = afi, 
Then, the MVT of Lagrange gives y € (a, 8) such that f’(y) = 0. 
It follows from the last part of item (d) of the previous proposition that 


fr) = FY)8V) = FOF) = 9, 
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so that f has a positive root. Since f doesn’t vanish identically, Corollary 11.58 
shows that it has a smallest positive zero, as wished. Oo 


From now on, we let z denote the smallest positive zero of f. The last part of 
item (d) of Proposition 11.59 gives 


o= na) =(5)s(5) 


and since f is positive on (0,2) this shows that g(4) = 0. Item (c) of Proposi- 
tion 11.59 allows us to write 


0<f(=)=+ 1 (=) =41 
ae > an 
so that f(4) = 1. Then, item (d) of Proposition 11.59 gives 


t(F x) =($)8@ -F@s(F) = 8@. (11.38) 


We are finally in position to go one step further. 
Proposition 11.61 With notations as above, we have that: 
(a) g: [0,2] > [-1, 1] is a decreasing bijection with continuous inverse. 
(b) g!: (-1,1) > (0, x) is differentiable. 
(c) Im(f) = Im(g) = [—1, 1]. 
(d) f(x + 1) = —f(x) and g(x + 1) = —g(a). 
(e) f and g are periodic, with period 21. 


Proof (a) Since g’ = —f < 0 in (0,7), we conclude that g is decreasing in [0, z]. 
Since 


1 = g(0) > g(r) = V1 —-f(@y = $1, 


we conclude that g(a) = —1. The rest follows from Theorem 8.35. 

(b) This follows directly from Theorem 9.28. Indeed, since f 4 0 in (0, 7), we get 
for x € (0, 7) that g’(x) = —f(x) £ 0. 

(c) Item (c) of Proposition 11.59 gives Im(g) C [—1, 1]. Then, item (a) above gives 
Im(g) = [—1, 1], from where (11.38) gives Im(f) = [-1, 1]. 

(d) An application of item (d) of Proposition 11.59, together with g(z) = —1, gives 


S(x+ 1) = f@)g(a) + f(m) 8) = FQ). 


Therefore, 
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and, hence, 


In turn, (11.38) gives 


g(x+ 7) =f(-x-3) =-f(x+ 5) 


= -fe(>) -F(F)e@) = -8@). 
(e) Firstly, two applications of (d) give 
fet 2x) =f((e+ 2) +2) =f +7) =f) 


and, analogously, g(x + 27) = g(x). 

Now, let p > O be such that f+ p) = f(x) for every x € R. Then 
f(p) =f) = 0, so that p > a (for, m is the smallest positive zero of f). Since 
f(x+z) = —f(x) andf is not identically zero, we also have p 4 2. If m < p < 27, 
then 0 < p—z <a and 


f(p — «) = f(P)g(a) — f(a) gp) = 9, 


so that p— a > zx. Therefore, p > 27, and 27 is the smallest p > 0 such that 
f(x+p) =f) for every x € R. 

Finally, we leave to the reader the (analogous) task of checking that 27 is the 
smallest p > 0 such that g(x + p) = g(x) for every x € R. Oo 


Our final result shows that f and g parametrize the unit circle of the cartesian 
plane. 


Proposition 11.62 For every point (x,y) in the unit circle x? + y? = 1 of the 
cartesian plane, there exists a unique @ € [0, 277) such that x = g(0) and y = f(@). 
Moreover, if yo = 0, then @ can be chosen in the interval [0, 1]. Also, as 6 varies 
from 0 to 2x, the point (g(@),f(@)) turns around the circle exactly once. 


Proof For the existence part, let (xo, yo) belong to the unit circle, so thatx,+y = 1. 
Since g is periodic of period 27 and Im(g) = [—1, 1], there exists a € [0, 277) such 
that x» = g(a). Therefore, 


Yo = 1-4» = 1- g(a) = f(a)’, 
so that yo = +f(q@). If yo = f(a), let 0 = a. If yo = —f(a), then the evenness g, 


the oddness of f and their periodicity give 


Xo = g(@) = g(—@) = g(2m — a) 
yo = —f(@) = f(-a) = fx —a)’ 


and it suffices to let 0 = 27 — a. 
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For the uniqueness part, assume, for the sake of contradiction, that there exist 
0 Ss 0; < 0, < Qn such that (x0, Yo) => (g(@1),f(01)) => (g(62),f(02)). Then, 


F (G2 — 61) = F(92)8(A1) — F(A1)8(A2) = 0, 


with 0 < 02 — 6; < 2z. Since z is the smallest positive zero of f and the next one 
is 27, this gives 02 — 6, = 2. However, if this is so, then 


Xo = g(62) = g(91 + ) = —g8 (01) = —Xx0 
yo =f (82) = f (0 + x) = —f (61) = —yo ’ 


so that x» = yo = O. This is a contradiction to the fact that (xo, yo) belongs to the 
unit circle. 
We leave the rest as an (easy) exercise to the reader. oO 


At this point, the reader might perhaps want to look again at Example 10.80 to 
realize that, in view of the four propositions above, that result gives a genuine proof 
of the fact that the unit circle has length 277. 

On the one hand, this validates the whole construction of the trigonometric 
functions, as usually done in elementary school Trigonometry. On the other, Leibniz 
formula for z (cf. Problem 9, page 484) provides a purely analytical way of getting 
numerical approximations for 2, which can be used to get the usual estimate 
mw & 3.14159, with five correct decimal places. 


Problems: Section 11.5 


1. Prove that, for every integer k > 1, 


and 


2k-1 


(<1) IY oy 
dX arr rs DL, Gpre 


2. One can prove that if J, J C R are open intervals and g : ] > J andf : J > Rare 
analytic, then so is f o g: J > R. Use this fact to show that log : (0, +00) > R 
is analytic and to find its Taylor series expansion around xo > 0. 
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3. Prove the following theorem of S. Bernstein’: let J C R be an open interval, 
f : 1 > R be infinitely differentiable and such that f(x) > 0 for every x € I 
and every sufficiently large natural n. Then, f is analytic. 

4. Let 7 C R be an open interval and p,q : I — R be analytic functions. The 
purpose of this problem is to show that, for aw, 6 € R and xp € J, there exists a 
unique analytic function f : J > R such that 


fl + pf tof =0 7 
fi) af =P" en 


To this end, do the following items: 


(a) Show that (11.39) has at most one solution. 

(b) Show that there exist some open interval J C J, containing xo, and an analytic 
function fy : J > R satisfying (11.39). 

If J is as in (b) and J # J, show that there exist an interval J’ C J, properly 
containing J, and an analytic function fy : J’ > R satisfying (11.39) and 
such that fy = f; along J. 

Finish the proof of the problem. 


(c 


wm 


(d 


wm 


° After S. N. Bernstein, Russian mathematician of the XX century. 
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Appendix A 
Glossary 


Problems tagged with a country’s name refer to any round of the corresponding 
national mathematical olympiad. For example a problem tagged “Brazil” means that 
it appeared in some round of some edition of the Brazilian Mathematical Olympiad. 
Problems proposed in other mathematical competitions, or which appeared in 
mathematical journals, are tagged with a specific set of initials, as listed below: 


. APMO: Asian-Pacific Mathematical Olympiad. 

. Austrian-Polish: Austrian-Polish Mathematical Olympiad. 
. BMO: Balkan Mathematical Olympiad. 

. Berkeley: Berkeley Preliminary Examination. 

. Baltic Way: Baltic Way Mathematical Contest. 


Crux: Crux Mathematicorum, a mathematical journal of the Canadian Mathe- 
matical Society. 


7. EKMC: Eotvés-Ktirschak Mathematics Competition (Hungary). 
8. IMO: International Mathematical Olympiad. 
9. IMO shortlist: problem proposed to the IMO, though not used. 
10. Israel-Hungary: Binational Mathematical Competition Israel-Hungary. 
11. Miklés-Schweitzer: The Miklés-Schweitzer Mathematics Competition 
(Hungary). 
12. NMC: Nordic Mathematical Contest. 
13. OCM: State of Ceara Mathematical Olympiad. 
14. OCS: South Cone Mathematical Olympiad. 
15. OBMU: Brazilian Mathematical Olympiad for University Students. 
16. OIM: Iberoamerican Mathematical Olympiad. 
17. OIM shortlist: problem proposed to the OIM, though not used. 
18. OIMU: Iberoamerican Mathematical Olympiad for University Students. 
19. Putnam: The William Lowell Mathematics Competition. 
20. TT: The Tournament of the Towns. 
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Appendix B 
Hints and Solutions 


Section 1.1 


1. Write ¢ = 5 = r, thus obtaining a = br, c = dr and, then,at+c=r(b+ 4d). 

2. Let x = 0.a;a2a3... and suppose that the sequence (a), d2,a3,...) iS periodic 
from some point on, as in (1.3), say. If y € N is the integer with decimal 
representation b;b2 ... by, conclude that 10'tPx = y+ 10!x, so that x = Ro 
a rational number. Conversely, let x = a witha,b € Nand0 <a<b.Ify,eN 
is the integer with decimal representation ajd2...a,z, use the division algorithm 
to conclude that 10a = by, + rz, with 0 < 7, < b. Then, use the fact that there 
is only a finite number of possibilities for 7, to assure the existence of natural 
numbers / and p such that rj, = r;. From this point on, conclude that the list 
(a, 42, 43,...) is as in (1.3), with by = aj41, bo = aj42,..., bp = Atty. 

3. Adapt, to the present case, the proof of the uniqueness of additive inverses, 
changing + by- and 0 by 1. 

4. Start by observing that0 = 0-a = (1+ (-l))a=1-a+ (-Da=a4 (-la; 
then, use the uniqueness of additive inverses. 

5. Among the given decimal representations, the only one that does not correspond 
to a rational number is that of item (d); in order to prove this claim, observe 
that, in the given decimal representation, there will be arbitrarily long sequences 
of zeros. In order to write the numbers of items (a), (b) and (c) as irreducible 
fractions, follow the steps suggested in the hint to Problem 2. 
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516 B_ Hints and Solutions 
Section 1.2 


1. For items (b) and (c), use (7’) and (a); item (g) follows from items (b) and (c). 
2. with equality if and only if a = b = c = 0. Adapt the proof of Corollary 1.5. 
3. For item (a), observe that (rs)” = (rs)--- (rs) = (for) -(s-++s) = rs". For 
the other items, apply analogous arguments. 
4. Consider separately the cases m < n,m = n and m > n. In each case, make 
judicious use of the property of item (b) of the last problem. 
5. Ifb = 2*.5!, with k,l € Z;, andn = max{k,/}, then $= 225" 
6. If n = 2k, with k € N, write x’ = (x*)* and, then, apply item (g) of 
Proposition 1.2. The case of an odd n can be treated similarly. 
7. Start by showing that 5 - i + ; - z > z. 
. Compute the difference ot - i 
9. Letting S denote the sum of the ten given numbers, show that 4S can be written 
as the sum of ten real numbers, each of which equals the sum of four of the ten 
given numbers. 
10. Argue as in Example 1.4, using the fact that 31 < 32 = 2° and 17 > 16 = 24. 
11. For items (a) and (b), begin by observing that a” < b” if and only if (¢)" <1; 
then, apply the result of Corollary 1.3. For item (c), begin by noticing that 
a" +b" < (a+ bd)" if and only if (-4)" + (4,)" < 1; then, apply the result 
of Corollary 1.3. 
12. Use the fact that a* < c-a? and b> < c-b* and, then, apply Pythagoras’ theorem. 
4 9 


13. The given inequality is equivalent to (4)" + (4)" + (3)" > 2. This last one 


holds trivially for n = 1 and n = 2; for n > 3, apply item (b) of Corollary 1.3 
to conclude that (3)" > ey > 2. 

14. Start by observing that, ifa > 4, then 0 < + + t + + <i4 ; + ; < 1, so that 
4 + i + 1 cannot be an integer. 

15. Ifa > 5 is an integer, show that 4(a — 4) > a; from this, conclude that it is not 
worth to have summands greater that 5. Then, by performing similar changes 
of summands, discard all of those which are equal to 4 or 5. Finally, show that 
is more advantageous to have more summands equal to 3 than summands equal 
to 2. 

16. If we let a be the common leftmost digit, show that there exist nonnegative 
integers k and / such that a-10* < 2” < (a+1)-10* anda-10! < 5” < (a+1)-10'; 
then, multiply these inequalities. 

17. Suppose that all three inequalities are true; the second one can be written as 
(a — d)(c — b) < 0, so that the first one gives a— d < 0 < c—b. Now, use the 
third inequality to get ad(c— b) < bc(a—d) and, then, arrive at a contradiction. 


oo 
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Section 1.3 


2. For item (a), for instance, show that (2/xy)" = xy and (./x./y)" = xy; then, use 
the definition of the n-th root. 

3. By contraposition,' show that, if b 4 0, then r is rational. 

4. Reduce this problem to the previous one. 

5. Argue by contraposition. 

6. If ab # 0, compute (a + b/2)? = (—cV/3)? to conclude that /2 should be 
rational, which is an absurd. Therefore, ab = 0 and, thus, a = 0 or b = O. Then, 
apply the result of Problem 3. 

7. By the sake of contradiction, write /2 = , with a and b relatively prime natural 
numbers, so that 2b” = a?. Then, successively show that a is even and b is even, 
thus reaching a contradiction. 

8. Adapt, to the present case, the hint given to the previous problem. 

Section 2.1 


. For item (a), it suffices to develop the right hand side to get 


-—y)@+y) =xa+y)-—yat+y) 
= +x) -(@y+y) 
=x -y., 
In what concerns (b), we have 
(ty) =@+y)@+y) =xe+y)ty@e+y) 
=V?+mtoytyY=xr +24 y’. 


Finally, item (c) is also obtained by expanding the right hand side, and we leave 
this to the reader. 


'The standard ways to prove a proposition of the form A => B (i.e., If A, then B) are either directly, 
by contraposition or by contradiction. In the first case, we assume the validity of assertion A and 
deduce the validity of assertion B directly; in the second case, we assume that assertion B is false 
and deduce, directly, that assertion A is also false; finally, in the third case, we assume that assertion 


A is true and assertion B is false and, from this, directly deduce a contradiction (i.e., deduce that an 


assertion that is obviously false should be true, which is something impossible to happen). For a 
detailed discussion on the fundamentals of Logic and methods of proof, we refer the reader to [21]. 
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. It suffices to notice that 


m n 2) ne? +n? + Pp’ 

—+—+ — = — 

np mp mn mnp 
_ (m+n py —2(mn + mp + np) 
7 mnp 


. The statement of the problem, together with item (a) of Proposition 2.1, gives 


us 
2 2 
(2) = (=) & (b(1 —a))? — (a1 — b))? = 0 
l-a 
& (b(1 — a) — a(1 —b))(bU — a) + a(1 — b)) = 0 
<> (b—a)(a+ b—2ab) = 0. 


However, since a # J, it follows that a + b = 2ab. Hence, we finally arrive at 


1 1 a+b 2ab _ 


— =2. 
a b ab ab 


. Successively applying items (b) and (a) of Proposition 2.1, we get 


—2 
i-G) 
(Vx— JV)? +299  (x-2Yayt+ y) +29 


a2 ey. Oe GTy 2-2 
~2@+y)  8@ty) 2 


. Applying item (a) of Proposition 2.1, we obtain 


(x3 a y a 2) = (x3 -y —_ 3) _ 


yrzZ 
_ We ty +2) Gy ONG? +9? +2) + @ - 9" 2) 
yrZ 
263 + 2°) +23 
~yte 
yrzZ 


= 40° —yzt7). 
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6. Sinceab=1 <0 + =bo i = a, we have 


(a-a) +5) _@-db+a _@&-Y 


a 
a? — b* a —b a —b 


=a 


where we have used item (a) of Proposition 2.1 in the next to last equality above. 
7. We have (y — x)(y + x) = 19°, with y — x and y + x integers such that 
0 < y—x < y+.x. Therefore, the only possible choice is to have y—x = 1| and 
y+x = 361, so that y = 181 andx = 180. 
8. A judicious application of item (b) of Proposition 2.1 gives 


a’ +b* = (a +b’) —2a’b? = [(a + b)? — 2ab]? — 2(ab)* 
= (m? — 2n)? — 2n? = m‘ — 4?n 4+ 2n’. 
9. Item (c) of Proposition 2.1 gives 
a + b8 = (a2)3 + (By = (2+ Bat — ab? + b+) 
= (at +b) —@b? = (2 +B)? 3b? 
= 1—3(ab)’, 
1=3(ab)? aS +6 


a®+b& ~~ a+b 


10. Start by observing that 


so that 


(ac + bd)? + (ad — bc)” = 
= (ac? + 2achd + bd”) + (ad? — 2adbc + b’c*) 
= AC 4P)4R(C 42) = (C4 P\(242). 
11. Adding 1 to both sides of the given equation, we get 
1? =121S=x+ytoaytl=(+)Dy+1). 


However, since x + 1,y + 1 > 1, the only possible choice is to have x + 1 = 
y+1=11. 

12. For item (a), it follows from item (a) of Proposition 2.1, with ./x and /y in 
place of x and y, respectively, that 


__ 
VEE Vy (VEE VI VEF V9) 


VEE SS _ VERT 


~ (fx)? - (Wy? x-y 
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13. 


14. 


15. 


16. 
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As for (b), we apply item (c) of Proposition 2.1, with ¥/x and 3/y in place of 
x and y, respectively, and observe that (3/x)? = Vx? and (2/x)? = x (and 
analogously for y). This way, we get 


_— V2 F afay + fy? 
VEL SI (Vat PSP F p+ VY) 
_ VP F pat VY 

x—y , 


Finally, item (c) follows immediately from (b). 
Item (a) of the previous problem gives 


(nt l)—n 2 res 
Jatl+ jn Vntlt+ Jn 2fn Jn 


The other inequality can be proved in a similar way. 
Applying twice item (a) of Problem 12, we obtain 


1 —  24+V2-v3_ 24+V2-v3 
DADs Ores aGae ° 3449 
_ 24+ ¥2- V¥3)3—4V2) 
- 32 — (4,/2)2 


= -50 + ¥2— V3)(3 —4V2). 


2(va+1- vn) =2 


First of all, note that, for all real x, we have °° + 3 = 2° + (/3)3 = («+ 
/3)(x7 — xs/3 + </9). Therefore, making x = /2, we get 


2V24+3 = (V24+ V3)(22- 72-73 + V9) 


and, hence, 
i. Ga2a4N9 
2+ 73 3 4 24/2 
_ 2-V2-V34 79 3-2V2 
34272 3 04/7, 
= 3—2V2)2-V2-734+ V9). 
It suffices to write y+ z = —x,x ++ z= —yandx+y = —z to realize that each 


of y+z,x+zandx-+ y is also nonzero. Now, making use of these equalities 
in (a), we obtain 
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x2 y 2 _ x2 y? 2 _ 5 
G+? @ty @+y Gy’ OF’ * 


Analogously, 7, ss + wor + =35 


te roe a 


17. Applying item . of Proposition 2.1 several times, we get 


= pb = (a af b*”) (a? = b*) 
= (a? + b**)(a'® 4 b\a"* =p 
= (a? + b*)(al® + 5!) (a8 + 58) (a8 — b8) 
= (a 4+ b**)(al® ae b!®)(a® Eis b®)(a* ae b*)(a* — b*) 
= (a? + D\fe* 4 b!%) an (a’ 4 b*)(a’ —b’) 
= (a? + b*)(a'? + bY)... +b )Y(a + b\(a—b), 
so that 


64 __ p64 


— (732 32 
CFD ET PETG BAH (FP Nat?) 


18. For item (a), it suffices to see that 


(a—b)(a" | +a" 7b+ a" ob +--+ ab"? 45") = 
= ala! + ab + a? 3p? +... 4 ab? +) 
=the 64a °F eso ey) 
=(a" +a" b+ at rp’ +---+a’b + ab™) 
~ (ab + a 2b? + a 3b +--+ ab! +.B”) 
=a"—b". 


With respect to (b), observe that the changes of sign make sense precisely 
because n is odd: 


(a+ bya" —a" b+ a FUP —---— ab"? 4+) = 
= ata"! — a" *b + a 3b? —---— ab”? + b) 
+ (a! — gp + a" 3p? —...— gb 2 + a) 
= aa b4¢? Paar + ab") 
(bea OP + OP a = a ED) 


=a'+b". 
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19. More generally, let us factorise x*” + 4y*”, where n € N, inspiring ourselves in 
the formula for (a + b)?: 


Af" 4 dy = 02")? + (2y")? 
= [(2")? + (2y2)? 4 2x2". 2y2"] — 2428. ay?" 
= (x" 4 2y")? — (2x"y"y? 
= ("4 Dy?" 4 2xv"y")(2" 4 2y2" — 2x"yNy, 
20. It follows from Example 2.7 that 
(atb+chH=G4h4+043(a+b\(at+c(b+o). 


Now, since (a+ b) + (a+c)+ (b+c) = 2(a+b+4 0), which is an even 
integer, we conclude that at least one of a+ b,a+ corb+c is also even, so 
that 3(a + b)(a+ c)(b + c) is a multiple of 6. Therefore, 


6|(a+tb+c) &6|(atb+c) 
& 6|fat+b+e43(at+b\(a+c(b+o)] 
&6|\(@+h +c’). 
21. We present two different solutions. For the first one, just note that 
at+tb+c=0 = (a+b)? = (-c)’ 
=> a+b + 3ab(at b) = -c’ 
=> a+b? + 3ab(—c) = —c? 
>a+b°?+ —3abe = 0. 
Another possibility is to use the result of Example 2.7: 
a+b+c=05 (a+b+c)?=0 
Ssa@+h4+43(atdb(a+o(b+c =0. 


Now, froma+b+c=0Owe geta+b=-c,a+c=-—b,b+c = —a and, 
then, 


3(a+ b)(a+c)(b +c) = 3(—c)(—b)(—a) = —3abc. 
22. Sincea+ Va? —b => a— Va? — b, both sides of the equality we wish to prove 


are nonnegative real numbers. Hence, it suffices to prove that their squares are 
equal, i.e., that 
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23. 


24. 


25. 


In view of the above, this is pretty clear. 
By the sake of contradiction, suppose that such x, y and z did exist. Then, 


writing = - 7 = i + 1 and performing the additions on both sides, we 
would easily conclude that x(x + y + z) = —yz, or, which is the same, that 
x? = —(xy + xz + yz). Similar reasoning would give us x7 = y? = 2 = 
—(xy + xz + yz). Now, from x? = y* we would get x = y or x = —y; however, 
since x + y # 0, we should have x = y. Analogously, we should also have 
x = z, so that the given equality would reduce to z = 2. This is obviously 
impossible. 
Making x = ie = — and z = a. we have 
+ xz + : + . + : 
xy +xz + yz = $+ +. ——— 
Pees 0-0-9 | b= Oa-b) C—ala-B) 
1 


a Cap Ga eee 


Now, if x,y,z are real numbers such that xy + xz + yz = 0, then we have 
ety? +2 =(x+y+4+ z)?. Therefore, in our case we have 


ee ee ets Omens 1 oe 
(b-—c)? ° (c—a)? (a—b) =~\b-c c-a ab) 


Finally, just note that — + — + — is a rational number. 
Initially, note that 


(Yat Vb =a+b4+3Vab(s/a + </b) 
and, hence, 


ah (a+ Vb) — (a+b) 


ate 
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From this, we conclude that 


a—b a—b 
3 Se) he =" - 
io (Ya? + Jab+ (Yop (Yat or — ab ~ 


Finally, 
fa= Flat V+ (Va VO €Q 


and, analogously, /b € Q. 
Raise both members of the equality a+c+d= —(b+e-+f) to the third power 
and apply the formula of Example 2.7 to get 


a+c?+d?+3(at+c)(atd)(c+d) = —(b +e +f?)—3(b+e)(b+f)(e+f). 


Now, since a? +b> +c? +d? +e3 +f? = 0, it follows from the above equality 
that 


(a+c)at+td)(c+d)=—-(b+eybt+fietf). 


Analogously, starting from a+e+f = —(b+c+d) and arguing as above, we 
find that 


(at eatflet+f)=—-b+ b+ ac +d). 
Finally, multiplying the two relations thus obtained and cancelling out the 
common factor (c + d)(e +f), we arrive at the desired relation. 
Start by observing that 

b<b+6ab4+1<b +60? +1<b'4+ 6b? 4+ 12b4+8 = (b+2)*. 

Therefore, if b> + 6ab+ lisa perfect cube, one has 

b+ 6ab+1= (b+ 1% =) 4+ 3b? + 3b41, 
so that 2a = b + 1. Substituting b = 2a — 1 into a* + 6ab + 1, we conclude 
that a? + 6a(2a— 1) + 1 = a? + 12a? — 6a + 1 is a perfect cube. However, it 
is immediate to verify that 

a<a+12a—6a+1< (a+ 4)’, 


so that the only possibilities are 


a’ + 12a’ —6a+1= (a+1)’, (a+ 2) or G4 3). 
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These possibilities give, respectively, the equations 9a7a — 9a = 0, 6a*— 
18a = 7 and 3a” — 33a = 26. The first one gives a = 1; the second has 
no integer solutions, for the left hand side is even, while the right hand one is 
odd; the third also has no integer solutions, for the left hand side is a multiple 
of 3, while the right hand one is not. 


Section 2.2 


1. For the first inequality, interpret |x — a| as the distance from x to a in the real line 
(alternatively, apply the definition of modulus, separately considering the cases 
x —a => 0 and x—a < 0). For the other three inequalities, adapt the hint given to 
the analysis of the first one. 

2. Separately analyse the cases (i) x, y => 0, (ii) x = 0 > y, (iii) y => 0 > x and (iv) 
x,y < 0, showing that equality occurs in all of them. 

3. For item (a), consider the cases x > O and x < 0. For item (b), argue as in 
Example 2.9. 

4. Separately analyse cases x < 0,0 < x < 1 and x > 1. The solution set is 
(—oo, 0) U (1, +00). 

5. Since |y| => y and | — y| = |y| for all real y, we have |x — a] > x — a and 
|x — b| = |b—x| => b—x. Therefore, in order for the equation to have a real root 
x, we must necessarily have 


c= |x-al+ |x-—b| => @—-a) 4+ (b-x) = b-a. 


Thus, a necessary condition for the existence of solutions is that c > b — a. 
In other words, if c < b — a, then the given equation will have no solutions. Let 
us see what happens if c > b— a (ie., let us see whether this condition also 
suffices to the existence of solutions). Since every real number «x satisfies one of 
x <a,a<x<borx > b, we shall analyse these cases separately: 


« x < a: then,x—a < 0 andx—b < 0 (why?), so that the given equation 
reduces to —(x— a) — (x—b) =c, whose root is x = 3(a +b—c). Verify that 
such a root indeed satisfies the condition x < a. 

* a<x <b: the given equation reduces to (x — a) — (b—x) = c, orb-—a=c. 
If such an equality is true, then every real number x satisfying a < x < b 
will be a solution of the equation; if it is false, then the equation will have no 
solutions x satisfying the inequalities a < x < b. 

¢ x > b: arguing as in the previous cases, we easily arrive at the solution 
x= $(a +b+c), which does satisfy condition x > b. 
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6. It suffices to see that 


r+1 /241 


2 1 
= - v3) = |—— +1- v3) = 
r+1 


r+1 
= | /2 —r| 
(r+ 1I)(V2 4+ 1) 


1 1 | 
1 
<5lr— 21, 


forr > Oand /2 > 1. 
7. For item (a), compute 4 — x1 + x. For item (b), start by using the result of (a), 


I+y 
together with the triangle inequality |a+b| < |a|+|b], to get paar < Taree 


8. For the first part, adapt the proof of Example 2.13. It may help you to notice that 


(n+ 1) +(n +2) +++ +2n)—(L+24+-2+4+n) =n? +14 24++++n)—(1 4 24++-4+n) =r’. 


For the second part, show that every x € (n,n + 1) is a root of the equation. 


Section 2.3 


1. If x =a is a root of the given equation, then a? + bla| +c = 0. But, then, we 
also have (—a)* + b| —a| +c =a? + bla| +c = 0, so that x = —a is also a 
root of the equation. Therefore, since aw + (—a) = 0, the sum of the roots has 
to be equal to 0. 

2. For item (a), squaring both sides of ./x + 2 = 10—x we getx +2 = (10—x)?, 
or x7 — 21x + 98 = 0. Since 7 + 14 = 21 and7- 14 = 98, the roots of this 
last equation are x = 7 and x = 14. However, only x = 7 satisfies the original 
equation, for, substituting x = 14 in it, we find 16 = —4, which is an absurd. 
In what concerns (b), we initially observe that, for the square roots to have a 
meaning in R, we must have -3 <x< 43 on the other hand, 

Vx+10= V2x+34+V1-3x 6 
& x+10 = (V2x+34+ V1 3x)? 
&x+10=4-x+2V7(2x4+ 3) — 3x) 


&x4+3= V(2x+4+ 3)(1 — 3x) 
> (x + 3)? = (2x + 3)(1 — 3x) 
7x7 + 13x+6=0, 


so thatx = —lorx = —§. Since both of these values belong to the interval 
[-3, +]. we conclude that they are the solutions of the given equation. Finally, 
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for item (c), we note that the substitution of variables y = x? + 18x transform 
the given equac¢ao into y + 30 = 2./y+ 45. Squaring both sides, we easily 
arrive at y* + 56y + 720 = 0, an equation whose roots are y = —36 and 
y = —20. From these, only y = —20 serves us, for y + 30 = 2,./y + 45 gives 
y+ 30> O0andy+ 45 = 0,ie., y > —30. Therefore, x? + 18x + 20 = 0 and, 
then, x = —9 + /61. 

3. Start by observing that, in order for “2x — 1 to have a meaning in the set of 
reals, we must have x > 5: By a similar reason, it must be the case that x > 


2x — 1. However, since 
x>V2x-1exr>2x-1 86 7-2x4+1>0, 


which is always true, the conditions for the existence of the square roots in the 
given equation reduce to x > 5: Now, 


2 
A= (vs+ V2x—-1+ yx—-vV2x-1 
= 2x+2y¥ a+ V2x—- I@- V2x- 1) 


= 2x + 2x? — (2x — 1) = 2x + 2|x—- 1]. 


We shall consider here only item (b), leaving the other two items as exercises to 
the reader. If A = 1, we have to solve the equation 2x + 2|x—1| = 1, under the 
condition x > 5. If x > 1, that equation reduces to 2x + 2(x— 1) = 1, so that 
x= 3 (which does not satisfy the condition x > 1). If 5 <x < 1, the equation 
reduces to 2x + 2(1 — x) = 1, an equality which is false for every x € (3. 1). 
Therefore, there are no solutions in item (b). 

4, Let ax* + bx + c = 0 be the equation of the first quiz, a'x* + bx +c = 0 be 
that of the second one and ax? + bx + c’ = 0 be that of the third one. Since 


the roots of the second quiz are 2 and 3, we have 4 = 5 and s = 6, so 
that —4 = 2; on the other hand, since the roots of the third quiz are 2 and —7, 
we have —4 = —5. Therefore, the second degree equation of the first quiz 


has the form ax” + 5ax — 6a = 0, and, thus, has the same roots as those of 
x + 5x—6=0,ie., —6 and 1. 

5. It follows from Proposition 2.16 that a + b = —a and ab = b. The second 
equality gives a = 1, and substituting this value into the first equality we get 
b= -2. 

6. Again by Proposition 2.16 (applied to both equations), we have —a = a? + 

2 = (a+ fp)? — 208 = 13-—2-9 = —5 and b = af? = (af)? = 9 = 81. 
Therefore, a + b = 86. 
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12. 
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The simplest possibility is x? — Sx + P = 0, where 


S=w+v=(utv)(W—w 4 v’) 
= —[(u+ v)* — 3uv] 
= —[(-1)’ - 3(-1)] = -4 


and P = wv? = (uv)? = (1)? = -1. 


. A possibility is the equation x? — S’x + P’ = 0, where S’ = (aS + P) + 


(BS + P) = (a+ B)S + 2P = S? + 2P and P’ = (aS + P)(BS + P) = 
apS? + (a+ B)SP + P? = 2S?P + P?. Here, we used the relations a + B = S 
and ap = P. 


. Ifa = 7+ 473 and B = 7—4¥V3, thena + 6 = 14 anda = 1, so that 


a and f are the roots of the second degree equation x* — 14x + 1 = 0. Hence, 
a? = 14a — 1 and B? = 146 — 1 and, starting from such equalities, we get 


qkt2 it pore = 14(a**! “its ee ee (aé ae B*). 


Now, make k successively equal to 0, 1, 2 and 3 to compute, also successively, 
a? + 67,03 + B?, a+ + Bt anda? + p°. 

Substituting x by a, we get the equality a? = a + 1 and, from it, «4 = (@?)? = 
(a+1)? =a?+2a+1 = 3a+2anda> =a-a* = a(3a+2) = 3a7+2a = 
3(a@ + 1) + 2a = 5a + 3. Hence, a? — 5a = 3. 

Since the roots of the equation must be integers whose product equals 5, we 
conclude that they must be (i) 1 and 5; or (ii) —1 and —5. In the first case 
m = —(1 +5) = —6, whereas, in the second, m = —((—1) + (—5)) = 6. 

The given equation is equivalent to x? — (b+ + 2a’)x + [bc + a’ (b+0)] = 0. 
In order for the roots of this last equation to be real and distinct, it suffices to 
show that A > 0, where A is its discriminant. In fact, since a 4 0, we have 


A = (b+ c+ 2a’)? —4[be + (b+ 0)] 
= (b+ c)? + 4a — 4bc = (b— 0)? + 4a* > 0. 
Making a = x — 1, we get vr—ax-1=0 (1) and, hence, + =x-a. 
Therefore, the original equation becomes x = fa + ./1— (x— a) or, which 
is the same, x — /a = /1 +a —x (2). Squaring both sides of (2), we get 
x*+a—2.fax =1+a—x, or x? — (2.,/a— 1)x— 1 = 0 (3). Subtracting (3) 
from (1), it follows that (2,/a— 1 —a)x = 0. Now, since x # 0, it must be that 


— . = T Abe -_ 14/5 
2./a —1—a = 0, from which a = 1. Then, x — - = 1, so thatx = =. 


However, (2) givesx-1=x—-—.fa= J/1+a—x > 0,80 thatx = 15 

If w is acommon root of both equations, then aa? —a?—a—(a+1) = 0 (*) and 
aa? —a—(a+1) = 0. Multiply the second equality by a to get aa*—a? —(a+ 1) 
a = 0, and subtract this result from (*) to arrive at (a+ l)a—a—(a+1) =0, 
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15. 


17. 


18. 


so thata = 1+ 4. Therefore, if the two given equations are to have a common 
root, this must be equal to | + i. It now suffices to verify that 1 + 4 actually is 
a root of both equations, and we leave this simple task to the reader. 

For item (a), it suffices to see that 


x — 3x7 + 5x = OP — 3x7 4+ 3x— 1) + (2x2) +3 
= (x— 1)? + 2-1) +3. 


Now, it follows from (a) that the equalities in the statement of the problem can 
be rewritten as 


(x— 1)? + 2-1) +3=1 and (V—1)?+2(y—1) +3 =5; 
adding these, we arrive at 
(x—1)? + (~-1% +2@4+y-2 =0. 
In the above relation, substitute the factorisation 
(@-1IP + 0-1? = +y-2[@-1?-@-DO-)+ 0-17] 
to get 
@t+y-2[@-1? -—@-D~y-1)+0-1) +2] =0. 
Now, there are two possibilities: x + y— 2 = 0 or (x— 1)? —(x—1)(Qv—- 1) + 


(y — 1)? + 2 = 0. In order to show that the second one doesn’t happen, make 
x—1=aandy-— 1 = band observe that 


be 2 3b? b\* 3b? 
a@ —ab+b?+2=a —ab+ —+—+2=(a--]) +—+2>0. 

4 4 2 4 
Compare the coefficients at both sides of the equality 

xe tar+bxte=(«-—al(’+d'x4+c) 

to get the equalities b’ — a = a,c’ — ab! = band —ac’ = c. Now, use the first 
and third relations to get b’ = a+aandc’ = ee showing, then, that these 
values for b’ and c’ also satisfy the relation c’ — wb’ = b (at this step you shall 
need to use the fact that a? + aw? + ba +c = 0. 
For item (a), let a = vos J/5 and b = V2 _ 5s so thata = a+ b. Then, 


use the fact that 


wo =a+b> 4+ 3ab(at+ b) =a +b + 3aba, 
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19. 


20. 


21. 


23. 


24. 


25: 


26. 


28. 


29. 


30. 
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together with the relation ab = —1. For (b), use the result of the previous 
problem to show that x* + 3x — 4 = (x— 1)(x* +x + 4), so that x = 1 is the 
only real root of x? + 3x—4 = 0. 

Use the result of Problem 21 to compare the coefficients of a3xX° + agx* +ayx+ 
ao = 0 and a3(x — x1) (x — x1) (x — x3). 

In order to compute the value of the first sum of powers, use relations (2.21), 
together with the identity a7 + 6? + y? =(a+ B+ y)*—2(aB + ay + By); 
for the other two sums of powers, start by observing that, since @ is a root of 
the equation, we have a* = 3a — 1, and analogously for B and y. Then add the 
left and right hand sides of these three identities. 

Expand the products in (y + d)* + a(y + d)? + b(y + d) +c and impose that 
the coefficient of y* equals 0. 

It suffices to see that 


Just note that 


First, note that the equality x” — x — 1 = 0 is equivalent to x — + = |. Then, use 
the identity 


(-2)((-2) 9) 
=([x-- Sete Moe fe 

x x 
The equality x?—4x+ 1 = 0 is equivalent to x+ + = 4. Now, use the expression 
for x + + deduced in the hint to Problem 23, and observe that x® + + — 
(x3 + a —2. 
Initially, consider the cases n = 2, 4 and 6; then, adapt the reasoning of these 
particular cases to the general one.) 
For item (a), see the hint to Problem 23. For item (b), argue as was done in the 
text for biquadratic equations. 
For item (a), let’s consider two cases: (i) if a,b > 0 thenc +d = —a < 0 


and cd = b > 0, so that c,d < 0; but then ef = d < 0, so thate > 0 > f. 
(ii) If a,b < 0, then, in the notations of (i), c > 0 > d. Therefore, starting 


B_ Hints and Solutions 531 


with the second equation, if necessary, we can assume from the beginning that 
a > 0 > b. Then, letting c > d be the roots of x7 + ax + b = 0, we get 
c+d= —-a < Oandcd = b < 0, so that c > O > d. For (b), we have 
A = nm’ — 4nand A’ = p? — 4q. Compute 


N= 


P= 
$ 
w+ 
DI 
aacane 
rm 
——— 
4 
w| | 
Dy 
Se 


= T(m? —2mVB + A+ 8m-+8V3) 


and notice that, since m > 0, 


A’ <A 6m —2m(VA—4) + (8VA — 3A) 
e —-J/A <m<3VA-8 &m<3VA-8 
(m+ 8)? < 9m? — 4n) 
 2(m— 1) > 9(n+ 2). 


For (c), note that the process cannot continue indefinitely if the inequality A’ < 
A always holds. Then, show that at some point we reach a situation in which 


n=—lorm = 1,n = —2. Ifn = —1, conclude that the roots of x7-+mx—1 = 0 
should be 1 and —1, so that m should be equal to 0, which is not the case. Then, 
we are left tom = 1 andn = —2. In this case, argue backwards to show that 
a=1,b=-2. 

Section 2.4 


N 


n 


. Execute the elimination algorithm in each of the items above. 

. Apply the elimination algorithm, in the way done in Example 2.26. 

. Apply the variable substitution a = 1, b=1l c= - to transform the given 
system into a linear system of three equations in three unknowns. 

. Show that xj = ayjby + arjb2 + a3jb3, for 1 <j< 3. 

. For item (a), it suffices to see that x} = x2 = --- = x, = 0 always is a solution. 
In what concerns (b), suppose that x; = a1, x2 = Q2,...,X%, = My, and x; = fj, 
Xo = Bo, ..., X,» = By are two distinct solutions of (2.25). If t € R is arbitrary, 
show that x; = ta; + (1 — t)8;, for 1 < i <n, is also a solution. Finally, for item 
(c), assume that the system has only one solution when b; = bp = --- = by = 0, 
so that this solution is x, = x» = --- = x, = 0. Let bj, bo, ..., bm be arbitrary 
real numbers, and x, = a1, X2 = Q2,...,X%, = Q@, and x; = fi, x = Bo, 

..5 X,» = By, be two solutions of (2.25). Then, it is immediate to check that 
XxX, = a — Bi, xX = a2 — Bo, ..., X» = Ay — By, is a solution of (2.25) when 
bj = by = --- = by = O, so that our assumption gives a; — 6; = 0, for 
l<i<n. 
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1. Equations x + yz = 2 and y+ xz = 2 give (x — y)(1 — z) = 0, so that either 
x = yorz = 1. Do the same for the other two pairs of equations, and compare 
the different possibilities that arise. 

2. Letting z= 1/(~+ y), we obtain the second degree system 


ztx=a+l 
wxw=a-1l : 


Hence, there are two possibilities: (z,x) = (a, 1) or (z,x) = (1, a). Those give 
rise to the systems 


-1 = 
y and or ai : 
1 x=a 
both of which can be immediately solved. 
3. Setu = = and v = ~~, and show that the given system is equivalent to the 


J xy 
second degree system 


utv=5 
uv =6" 
so that u = 2 and v = 3, or vice-versa. 
4. Add the three given equations and write the result as 


(axi + (b— Ixi +0) + (a5 + O- De t+ +(@5+O-Det+o=0. 


Then, apply the result of Lemma 2.14 to the second degree trinomial ax” + 
(b — 1)x + c and analyse separately itens (a) and (b). 

5. Suppose, without loss of generality, that x > y. Then, use the equations of the 
system to conclude that x = y = z. Finally, note that x = | is a root of the third 
degree polynomial equation x7 —2x-+1 = 0, and apply the result of Problem 21, 
page 43. 

6. Write the left hand side as a sum of squares and apply Lemma 2.31, withn = 3. 

7. Substituting y = +(a — z— 3x) into the first equation, we get 


1 
et ge 2-30? = 4z 


or, which is the same, 


25x? — 6x(a — z) + (a— 2)? — 64z = 0. 
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10. 


For this last equation to have a single solution, the discriminant of the second 
degree trinomial (in x) of the left hand side must be equal to 0, i.e., we should 
have 


36(a — z)* = 100[(a— z)? — 64z]. 


or, which is the same, (a — z)* = 100z. If this is so, then (from the second 
degree equation in x above) x = + (a —z) and (from the second equation of the 
system) y = x(a —z). Therefore, the system has a single solution if and only if 
the equation (in z) (a — z)* = 100z has a single solution. Since it is equivalent 
to 2? — 2z(a + 50) + a? = 0, its discriminant must also be equal to 0, i.e., we 
should have (a + 50)? = a. Thus, a = —25. 


. Start by writing 


3 3 
A oN = Gat lg er 
x= x— 


2 
zs —2Vx+2 


= 
(EY 


Then, apply Lemma 2.31 to conclude that it suffices to solve the system of 
equations 


2 2 2 
ap a 


so thatx=y=z= za BE) 


Letting a = Yx+5andb = V4—x, we geta+b =3anda+b = 9. 
Hence 


=(x-1)+ 


9=a4+b) =a? + B-a) =27-27at+ 9a 


or, which is the same, a2 — 3a + 2 = 0. Therefore, a = 1 or 2, from where 
x=—4orx=3. 

Argue as in the previous problem, letting y = /5 —x to transform the given 
equation into a system of equation in x and y. 


534 B_ Hints and Solutions 


11. First of all, observe that 
2 2 
2 x 2 1 
a {= 
* G4 al i) 
1 2 
(x+1)? x4+1 


=x 4+1+4+ 


(x? +2x+1)+ 


II 


1 
(x + 1)? x+1 


1 2 
= 1)? + ———. -2 1) - —— +2. 
(x+ 1) + G+ 1? (x + 1) ai + 
Now, make the variable substitution y = x + 1+ = to get 
ot 
(x + 1)? 


so that the given equation simplifies to the second degree equation 
y? — 2y—3 = 0, whose roots are 3 and —1. Then, we have 


1 
x+1+—=3 or —-l, 
x+1 


and each of these possibilities gives rise to a second degree equation in x. 
12. Firstly, observe that, if x = 0, then y = z = 0 (and analogously if y = 0 or 
z = 0). Therefore, we can assume that xyz # 0. Setting u = 1, v= and 


_ 1 
w = -—, we get 


1 1+ 4x 1 u- 
= +1=—+4+1. 


| — ee 
y 4x? 4x? 4 


Transforming the other two equations in analogous ways, we get the system 
below, which is equivalent to the original one: 


w+4=4v, v%+4=4y, w+4= 4u. 
Adding these three equations, we arrive at the equality 
(u—2)? + (v-2)7 + w-2P =0, 


and Lemma 2.31 gives u = v = w = 2 and, hence,x =y=z= 3. Thus, the 


solutions of the system arex = y= z=Oorx=y=z= 5. 
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13. 


14. 


15. 


16. 


17. 


Add the equations of the system to conclude that it suffices to solve the system 
of equations 


a+b+c=3d 
b+c+d=3a 
c+td+a=3b° 
d+at+b=3c 


Then, note thata = b= c =d=4(at+b+c+d). 

Write the given equation as (EZ; — F)) + (Eo — Fr) + --- + (Ey, — Fn) = 0, 
observing that FE; — F; => Oforl <i<n. 

For u € R, note that 


4° —ut = 4-— (4-407 +4) =4-2-w)’. 
Hence, 
6 = V4? -—x4 -3 4 V4y?-yt + V42-2° 45 


= V1-Q-#) + ¥4-@-y)? + V9-Q-2? 
V1 W/4p4/9 = 1494-3, 


and it follows from the previous problem that 2— x” = 2—y? =2-—2=0, 
ie.,x = +V2,y=4+V2,2= +2. 

Ifx > 4, write x4 =x+(3- +) to conclude that z > 4; then, argue similarly 
with the second equation to conclude that y > 4. Then, either x,y,z > 4 or 
x,y, Z < 4. In either case, add the three equations of the system to get 


4 x 
x = y = z 4 
and use the result of Problem 14. 
For item (a), write x + 2 2/2 = 1 “(x- af 2). For item (b), start by observing 
that x+ 2 = 2y gives that x and y are both positive or both negative; analogously, 
the same is true for y and z. Therefore, either x, y and z are all positive or 
all negative. Now, note that (x, y, z) is a solution of the system if and only if 
(—x, —y, —z) is also a solution, so that we can restrict to the case x,y,z > 0. 
Now, the result of item (a) gives 


2 
2y=x+—>2Vv2, 
x 
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so that y > 2; analogously, x,z > /2. Adding the three equations of the 
system, we get 

2 2 2 
X+yrZ=—-+r-+-. 
x y 2 


Finally, note that u > 2 for u > J2, and use the result of Problem 14. 


Section 3.1 


11. 


. For item (a), let a, = n, for odd n > 1, anda, = (—1)""!n, for evenn > 1. 


For (b), let a, = ort , forn > 1. Finally, for item (c), let a, = n, for oddn > 1, 


and a, = 1 forevenn > 1. 


— 1 _ 3agt+l _ i = 
. It suffices to note that bet ae oe 3+ os 3+ by. 
. For item (a), we can write a) = ay = a3 = 1 and ay = ay_) + ay_2 + ay_3, for 


k => 4. For item (b), we can write a; = | and a4; = 2%, fork > 1. 


. Observe that the entries in the n-th line form and AP of initial term n, common 


difference 2 and 2n — 1 terms. 


. Write 11... 1 as a sum of powers of 10, then use the formula for the sum of the 
—— 


n 
terms of a GP. 


. Use the result of the previous problem. 
. It suffices to show that by41 — by does not depend on k; the common value we 


shall obtain will be the common difference of the AP (b,)x>1. By definition, we 
have 


2 2 2 2 
biti — Dk = (Ay42 — M1) — (G41 — |) 


= (Ag42 — Ak41)(Ak+2 + Ae41) — (Ae+1 — Ak) (A411 + a) 


r(ag+2 + G41) — r(a+1 + ax) 


II 


= r(ax42 — ay) = r+ 2r = 2r°’. 


. Apply the formula for the general term of an AP to a, + dg and a, + dy. 
. Let a; = rm, with m € Z . By the formula for the general term of AP’s, we 


have 
a, +a, = 2a, + (K4+1-—VDr=a,t+ (m+k41—1)r = anyesi. 


The formula for the general term of AP’s gives us the relations a = a, = 
a+ (p—Dr, B = ag = a1 + (¢— Vrand apy, = a, + (p +g — l)r, where r 
is the common difference of the AP. By viewing the first two relations above as 
a linear system of equations in a, and r, express them in terms of a, 8, p and q; 
then, substitute these expressions for a, and r into the formula for ap+,. 
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12. Letting 2n — 1 denote the smallest of such integers and k their quantity, the 
formula for the general term of an AP assures that the biggest of the numbers 
equals (2n — 1) + 2(k — 1) = 2n + 2k — 3. Therefore, their sum is equal to 
5[(2n — 1) + (Qn + 2k — 3)|k = 7°, so that (2n + k — 2)k = 7°. Now, since 
2n+k—2>k> 1, the only possibility is 2n + k —2 = 7? andk = 7, so that 
n= 22. 

13. Let r be the common difference of the AP and let ay = m, so that r € N and 
m > 1. Then, amx+2 = a2 + ((mk + 2) — 2)r = m+ mkr = m(1 + kr), which 
is clearly a composite number. 

14. Item (b) follows immediately from (a). For item (a), separately consider the 
cases n even and n odd. 

15. By contradiction, suppose that a, = /2, a, = J/3 and ap = 5, where 
(ax)¢>1 is an AP and m, n and p are pairwise distinct natural numbers. Then, 
use the formula for the general term of AP’s to compute a a 

16. Apply, to the left hand side, the formula for the general term of GP’s. 

17. Adapt, to the present case, the solution given to the Example 3.14. 

18. Adapt, to the present case, the solution given to the Example 3.14. 

19. Suppose that there exists such an AP (axz),>1, of common difference r, i.e., 
such that a & Q but aw = Amp, for some distinct indices m, n and p. Use the 
formula for the general term of an AP to reach a contradiction. 

20. Show that the sequence (b;)x>1, such that b, = a, — 1, is a GP. 

22. Suppose that (a;)x>1 is a GP of common ratio g, such that a, = 2, a, = 3 and 
ay = 5. Then, aig”! = 2, aig"! = 3 and ajq?—! = 5, so that g”" = = and 
ge 2, Therefore, 

(=) af = gre? = (3) 
3 5 
or, which is the same, 2”? - 5" = 3"? Now, get a contradiction. 

Section 3.2 

1. Apply the result of Theorem 3.16. 

2. Apply the result of Theorem 3.16. 


. Start by observing that ay42 — ag4) = (2ag+ ) — 1) — (Zag — 1) = Zags) — Zax 


and, hence, that az42 — 3az41 + 2a, = 0, for every integer k > 1. Now, apply 
the result of Theorem 3.16. 


. Use the fact that x? + rx + 5 = (x—@)’. 
. For item (a), apply Theorem 3.16. For item (b), use the formula of (a). Finally, 


for (c), use the formulas of item (a) and (3.10). 
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8. 


For item (a), the given recurrence relation furnishes 
Abe. , =4(1 + 24an41) = 44+ 6(1 + day + V1 + 24) 
=446(1+ z(Hf=1) + bx) 
= be + 6b, + 9 = (he + 3)’. 
For item (b), it follows from (a) that 2b;42 — baz) = 3 = 2by41 — by and, thus, 
2bn42 —3bn41 +b, = 0. Since b} = 5 and by = 1 + 24a = 4, Theorem 3.16 


guarantees that b, = 3 + = At last, for item (c), it follows from items (a) and 
(b) that 


= My ay = a((2+#) -)) 
24°" 24 2 
ald) (o-4 
24 2P 2 
=3(1+ )(+5). 
3 Qn-1 Qn 


= 
| 


Sections 3.3 


NN BWLD 


. If (ax)xe1 is a second order AP, then (az4+1 — ag)x>1 iS a non constant AP, so 


that there exists r 4 O for which (ag42 — ap41) — (Ax+1 — Gx) = r, for every 
k > 1. In particular, for every k > 1, we have ag42 — 2ax4, + a, # 0 and 


(443 — Ak+2) — (Ak+2 — Ge+1) = (de+2 — Akt1) — (Ak-+1 — ak). 


The converse can be prove in an analogous way. 


. Write ay41 — a, = 3k — 1 and use telescoping sums. 
. Write ay41 — ay = 8k and use telescoping sums. 
. Observe that 1 = i and use telescoping sums. 


(Dk ~ k-1 


. The previous problem is a particular case of the present one. 
. Start by noting that 


1 1 
RE < © 


for every integer k > 1; then, apply the result 
of Problem 4. 


. Adapt to the present case the hint to Problem 4, writing 


1 i 1 1 
(4k—1)(4k+3) 4\4k-1 4k43/)° 
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8. 


10. 


11. 


12. 


13. 


14. 


For item (a), it suffices to observe that (2k + 1)? — (2k —1)? = 24K7+2 = 
16k* + 8k? + 2 = 16k” + (2k — 1)? + (2k + 1)?. For item (b), start by using 
the formula for telescoping sums to write 


(2n+ 1)?-2 = [2k + 1)? — 2k 1)*] + (3° - 2) 
k=2 


= S [(4k)? + (2k + 1)? + 2k = 1)*] +25 
k=2 


S14? + (2k + 1)? + (2k = 17] + 4 + 3?. 
k=2 


Then, observe that the last expression above is a sum of (n—1)-3+2 = 3n—1 
perfect squares. 


. Note that ean = Sratia ="- es and use telescoping sums. 
For item (a), observe that (k+1)? +k? +47(k4-1)? = k44+-247+3k?+2k+1 = 


(k* + k + 1)*. In what concerns (b), it follows from (a) that 


1 1 (+k4+1) R+k4+1 
— + —— +1 = || ——— = ——_. 
ke (k+1)? K(k + 1)? k(k + 1) 


It now suffices to write 


R+k+1 1 1 1 
lle ae 1 — ef Se 
k(k + 1) k(k + 1) k k+1 
and use the formula for telescoping sums or the result of Problem 4. 
: Fx _— Froo-Fk 1 1 . 
Write ae = aes = Fi 7 Fa and, then, use the formula for telescoping 
sums. 


Rationalise the fraction —see Problem 12, page 25—and, then, use 


1 
Vart Jap 
the formula for telescoping sums. 

For item (a), just note that x* +27 +1 = (x4 +2x7 + 1I)—x* = °° +1)?-2? = 


(x? + 1 —x)(x? + 1 + x). For (b), use the result of (a) to write 
k = k 
M+e+1 (P@+k+ 1)? —k+1) 
a 1 1 
2B Ak+1 P+k+1)" 
Then, use the formula for telescoping sums. 


Rationalise the fraction under the )* notation—see Problem 12, page 25—and, 
then, use the formula for telescoping sums. 
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15. Adapt, to the present case, the hint given to the previous problem. 
16. Initially, observe that 


j=2 j=2 
A Ger 0 Ge 


Now, take a, = k for k > 1. It follows from Proposition 3.29 that 


N(-7) NCU) 


_ a Gn41 _ I nt+1_ n+l 


GQ, a n 2, 2n 


17. If S is the sum we wish to compute, then 


101 3 
> 


Since 1 — x, = 1-4 


01 — *101-k, We get 


101 3 


oe, a ee 
= eo Bae 
k=0 *101—-k T Xk 
and can write 
101 3 101 3 
i ee ee AE 
- x + Pod x + x? 
k=0 101k k aq *101-k k 
101 3 101 
= Xj *101-k 
~~ 3 3 3 3 
tao tik XK ay More FX 


101 101 


3 3 
Xp +X 
_ k 101-k __ = 
=) = 4s y Tie. 
k=0 101—k k k=0 


Hence, S = 51. 

18. Start by factorising both the numerator and denominator of the fraction under 
the [] notation. If a, = k? —k + 1, show that ax4, = k* +k + 1 and, then, use 
the formula for telescoping products. 
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19. We first show that n must be even. To this end, suppose that A = A; U---UAp, 
with Aj, ..., A, satisfying the conditions of the problem. For 1 < k < n, let 
xx € Ax be the element of A; that equals the arithmetic mean of the other three. 
Then 4x, = Yo i< 4, x and it follows that 


‘Yas ioe oe. -+4n = 2n(4n + 1). 


k=1 x€Ag x€A 


Hence, n(4n + 1) must be even and, since 4n + 1 is odd, it follows that n must 
be even. Now, let n = 2k, with k € N, and write 


{1,2,3,..., 8k} =A) U---UAdg, 
where 
Aj-1 = 8G—1) + {1,3,4,8} and As; = 8G — 1) + {2,5,6, 7}. 
(Here, for X C R, we define the set X¥ + tby X +t = {xn +14; x € X}). Since 
3-4=1+4+3+4+8and3-5=2+46+ 7, it is immediate to verify that the sets 


A;, defined as above, satisfy the conditions of the problem. 
20. By the result of Problem 13, page 26, we have 


(Va¥1- Va) Va = Vt tn— Vn 


iy 1 
< n+-o- —-vVn2= — 
5 Py 


and, thus, 2(./n + 1—./n) < ie Analogously, we get 7 < 2(./n—Vn— 1). 


Taking these two inequalities together, we get 


10000 


y (VEFT~ vi) < YG < ¥ (ve-V=T). 


= A 
However, 
10000 
(Vet T- vk) = 210001 — 2V2 
k=2 
and 


Cee ean = 2 (V10000 - 1) = 198. 


k=2 
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Hence, if we show that 2.710001 — 2/2 > 197, it will come that the greatest 
integer which is less than or equal to S equals 198. To what is left to do, just see 
that 


2/10001 > 2V2 +197 & (2v 10001) = (2v2+ 197) 


& 40004 > 788/2 + 38817 


& 788V2 < 1187 & 7887-2 < 1187 
€ 1241888 < 1408969, 


which is indeed true. 
21. For item (a), factorising (k + 1)?t! — kPt! with the aid of the result of 
Problem 18, page 26, we get 


p+l 
(k+ Prt — ett = Sk + pert Re 


j=l 
pti 


< okt yPettes 1 


j=l 
= (p+ IkK+ 1). 
Analogously, 


ptl 
(k+ 1p) — pti — Yokt+ ppt lip! 


j=l 
pti 


> ete! = ot Dk. 
For item (b), it follows from (a) and from the formula for telescoping sums that 
nl n-l +1 _ pptl p+1 p+1 
ye < —— ke _W se aL nt 
p+l pt+l ptl 
and 
perl (k _ 1)! nptl 


re ag ee 


k=1 
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Section 4.1 


1. It follows from the induction hypothesis that 


k(k +1) (k+ Ik +2) 
—— a 


14+24+---+k+(k+)= +(kK+1)= 5) 


2. It follows from the induction hypothesis that 


k 


ae (Me Dy +04 1) = ( para’ 
; | 


2 


j=l 


3. It follows from the induction hypothesis that 


1 1 1 1 1 
1—--+-—-—.---4+ —— — — + ——_ = 

2° 3 Sea: We” Bee 1 

ot 1 She 1 as 1 

— k k+d MA—-1 2k 2k+1 

=. il reer 1 dh 1 

a. MR—-1 2k WKk+1° 


4. By induction hypothesis, we have 
(k+ 1) +h) + A(2) + A(3) + +--+ h(k— 1) + h(k) = 14+ kh(k) + A(k), 
Therefore, it suffices to prove that 1 + (k + 1)h(k) = (K+ 1)h(k + 1) or, which 
is the same, that (k + 1)(h(k + 1) — h(k)) = 1. However, 


(k + 1)(h(k + 1) —A(Q) = +1) =I 


5. It follows from the induction hypothesis that 


II 


k 
YiGt+) i(k I)k(k + 1) + k(k +1) 
j=l 


= she tS 2): 


6. It follows from the induction hypothesis that 


k+1 
YS Q- 1)? = $k = 1)2k(2k + 1) + (2k + ibe 
j=l 


II 


502k + 1)2(k + 1)(2k + 3). 


544 


10. 


11. 


12. 


13. 
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. In trying to appeal to the induction hypothesis, we observe that 4*+! + 


15(k+1)—1 = (444+ 15k—1) +3(44 +5). Then, since 4* + 15k—1 is a multiple 
of 9 (by that induction hypothesis), it suffices to show that 4* + 5 is a multiple 
of 3. To this end, make another proof by induction: 4’F! +5 = (4'+5)+3-4!; 
since 4’ + 5 (by the new induction hypothesis) and 3 - 4! are multiples of 3, it 
follows that 4't! + 5 is also a multiple of 3. 


. Appealing to the induction hypothesis, we observe that (k + 1)? — (k+ 1) = 


(k? — k) + 3(k* + k). Now, since k* — k (by that induction hypothesis) and 
3(k? + k) are multiples of 3, it comes that (k + 1)? — (k + 1) is also a multiple 
of 3. 

If 43 ' — 1 = 3*g, for some gq € N, then 


1a p-1s @ -ye ps $Y 
= 3g (4)? — 1) + 4" = 1) 43] 
= 3g((4" — 14" +1) + 3*¢ +3] 
= 3g3'q(4® | +1) +3'g+ 3] 
= 3 gpsktg(4e + 1) + 3g + I]. 


Suppose we have n; ways of choosing an object of type 1, nz ways of choosing 
an object of type 2, ..., n, ways of choosing an object of type k, ng+1 ways of 
choosing an object of type k + 1. In order to choose an object of each of these 
k + 1 types, start by choosing an object of each of the types from 1 to k. By 
induction hypothesis, there are exactly njnz...nx possible distinct choices. On 
the other hand, for each of these choices, we have ng+; ways of choosing an 
object of type k + 1. Therefore, the number of ways of choosing an object of 
each of the types from 1 to k + 1 is equal to 


NyN2...Nk+NN2...Ng ters $NjN2...Ng = NN... NENK+1- 
————— ee, 


n+ times 


By induction hypothesis, suppose that any set with k elements has exactly 2‘ 
subsets. If A = {aj,d0,..., 4%, 441} is a set with k + 1 elements. It has two 
distinct kinds of subsets: those which are subsets of B = {a,d2,...,a,} and 
those which contain a;,4 . By the induction hypothesis, A has exactly 2* subsets 
of the first type. On the other hand, the subsets of the second type are of the form 
B’ U {ax41}, where B’ is a subset of B. Again by induction hypothesis, there are 
also 2* subsets of this second type. Hence, A has 2* + 2* = 2**! subsets. 

To the first case, by using the induction hypothesis (strong induction!), we get 
UR+2 = —TUg+1 — SUR = —AR+1 — SAK = Ap+2. The second case is completely 
analogous. 
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14. By induction hypothesis, suppose that (a), a2,...,a,) is an AP of common 
difference r. Then, 


k k-1 


k 


Qak+1 


1 


+ 
ajaj+ = ajaj+1 Qkak+1 a1 ak Dpak+1 


1 1 k-1 1 
+ 


j=l 
and, hence, kay = (k — 1)ay4, + a;. However, a, = a, + (k — 1)r, so that 
(k= Vapy = Kar + k— Dr) — ay = K— 1a + er). 


Therefore, ay4, = a, + kr, and (a), ao,..., 4, 4k+1) is also an AP. 
15. For item (a), the induction hypothesis gives 


Ak+1 =a,-a+1=a(a,—1) +1 


=a: (a1...ae-1) HL =a... ar-1a, + 1. 


For (b), it follows from the induction hypothesis and the result of (a) that 


k+1 k 
1 1 1 1 1 
j aj a aja2...a a 
j=l qj pre k+1 142 k k+1 
Agn+1 — a\a2...ag 1 
a0 ee SS 
{a2 ...AKAK+1 Q1Q2...AKAk+1 


16. By induction hypothesis, we have 2”°*' = (22)? > k, so that it suffices to 


k 
show that k* > (k + 1)**! or, which is the same, (5) >k+1. Such an 
inequality is true for every integer k > 3, for we have 


ke \* i ae 
(—) =(k-14 75) > (k-1Ik¥ > k-DPEk41. 


17. Forn € Z, let a, = x" + x". We first show that a, € Z for every n € N. The 
hypothesis of the problem gives a; € Z, so that 


2 1 1\* 2 
ay=x+5=(x4+-) -2=a,-2€Z. 
x x 


Now suppose, by induction hypothesis, that aj,a2,...,a, € Z, for a certain 
integer k > 2. Then, 
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18. 


19. 


20. 


21. 


22 
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ll 
ne 
7 
+ 
) 
Ne” 
3 
+ 
Sle 
wae 
| 
So TS 
‘ 
i] 
VSS 


= aga, — Ag-1 € Z, 


and the induction step is complete. To what was left to show, it suffices to see 
that ag = 2 and, if n < 0 is an integer, then a, = a_, € Z. 
It follows from induction hypothesis that 

ied = (f — 3x)? = x8 - 6x4 + 9x7 


= (yx + 2)? — 6(y~ + 2)? + Oye + 2) 


= ye — 3ye +2 = Neg +2. 


It suffices to prove that the quotient Sot Pend does not depend on n. This is 
immediate by induction, for 


2: 2 
Xnt2FXn — Xnt1 + Xn-1 _ Xnt2Xn +X, Xp T Xn—1Xn+1 
Xn+1 Xn Xn+1Xn 
2 2: 2 2, 
_ n+ = 1) +X, — Xn4+1 (x, = 1) - 


Xn+1Xn 


0. 


For item (a) we have, by induction hypothesis, Fy + Fy +--+ + Fy + Fre4i = 
(Fet2 — 1) + Frey. = Fe43 — 1. For (b) (and also by induction hypothesis), 


Fi + F; teeet Fe + Bey = PF epi + rae = FyyiFe+2. 


Items (c), (d) and (e) can be dealt with in an analogous way. 
We make induction on n > 1. For n = 1, the relation in the statement of the 
problem is simply the recursive definition of F’,,4,. In general, it follows from 
the induction hypothesis that 
Pinpktt = Pinte + Fin+k-1 

= (Fini + Fm Fa) + (FinF a + Fin-1F-1) 

= Fy (Freoi + Fe) + Fm—-i (Fe + Fe-1) 

= FinFpi + Fm-1F +1. 


In both items, make induction on n, for a fixed m > 1. For the initial cases 
2Fingt = Fin + Ly and 2Ln41 = S5Fin + Lm, you will also have to make 
inductive arguments. 
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23. 


24. 


25. 


26. 


For (a), observe that F} = 1, Fo = 1, F3 = 2, Fy = 3, Fs = 5, Fo = 8, 
Fy = 13, Fg = 21, Fo = 34, Fio = 55, Fry = 89, Fig = 144, Fi3 = 233, 
Fis = 377. Therefore, F) = 12, Fyo = 12? and Fy; > 132, Fi4 > 142. On 
the other hand, if Fy > k? and Fyi, > (k + 1)*, then Fys2 = Fe + Fra > 
ke + (k +1)? > (k + 2)* for every k > 3. Hence, we get by induction that 
F,, > n? for every n > 13. For the first part of item (b), note that a is a root of 
the second degree equation x? — x — 1 = 0, so that a* = @ + 1. Thus, we have 
by induction that 


ait! = ak .qga (Fra + Fr-\)a = Fro? + Fp-10 
= Fi(a+1)+ Fra = (Fy + Fy-1)a + Fy 
= Fria + Fr, 
Now, note that a” — n?a = F,a + F,-; —n’a = (F, —n*)a + F,_1. Since a 
is irrational, it follows from this equality that w” — n?@ will be an integer if and 
only if F,, — n* = 0. Hence, the solution of ii. follows from that of the item (a). 
Use induction to construct a subsequence (dy, )x>1 Of (Gn)n>1 Such that ay, > 


Nk, A1,€2,...,Anj_y- 
If a; = j for 1 <j < k, then we get from 


ay +a +--+ + ap + apy = (ay +n ++ + ae + ay)? 
that 
3 3 3 3 _ 2 
P+ 2° +--+ +a, = (+245 +k + apy). 


According to the result of Problem 2, this last relation gives 


1 1 
glk + DP + ay, = (See +1)+ asi) ; 


Performing the computations at the right hand side, we arrive at a? 41 = 4ayt 
k(k+ 1)ax41 or, which is the same, Gea: —ag41—k(K+1) = 0. Since az; > 0, 
it immediately follows that a4; = k+ 1. 

If x, > ./a, then 


sisi Vi = 3 (me - 28+ 2) = Dt Ja)’ > 0. 


To the other inequality, we start by observing that, if x > y > /a, then x + = 
y+ _ (*); in fact, 
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2d: 


28. 
29. 
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(x+ *)— (y+ “) =-y+a(2--) = Foy a) >0, 
x y x y 


for xy < Ja = a. Now, if x, < fat ser: then, from (*), we get 


ee eg ee ee 
SSX —_ — —. eee 
acuee | Gla) fa) Wee uae 2 


1 1 a 1 
If k > 6 and ay, a2, ..., dx are positive integers such that 
ee rae 
a a; gq 
then 
Eee et 1 a 1 Sed. 1 eae 
4° 4° 4° (2a;)2 © (2ay)? (Qa)? 4°44 ~~ 


Hence, by a simple variant of the strong form of the principle of mathematical 
induction, it suffices to consider the cases k = 6,k = 7 andk = 8. Before we 
look at them, note that 


Pegs ae 1 4 hae g 1 

~+-+——=-— and —-+-4+-4-<=1. 

9 9 36 4 4 4 4 4 

Therefore, fork = 6,wehave++44+i4+4+4+24 = | fork = 
1 1 1 1 1 1 Te a L- 

Pie PONG eg ag ae Ps, og ay, rr Oe By OES 

atagtgtgtetoty ta =! 


Argue as in the proof of Example 4.12. 

For the existence of a representation for n, if (k + 1)! <n < (K+ 2)!, let ay 
be the greatest natural number such that ay4,(k+1)! <n < (agg, +1) (K4+1)!. 
Then, the inequalities ay4,(k+1)! <n < (k+2)! = (kK+2)-(k+ 1)! guarantee 
that ay41; < k+ 1; on the other hand, since 


n— ape lk +1)! < (aeg1 + D(A + Dl api + D!= (K+ YD! 
it follows from the induction hypothesis that 
n—agyi(kK+ I)! =a,- I +a)-2!+---+aq-kl, 
for certain integers a1, dz, ..., ax, such that 0 < a; <j, for 1 <j < k. For the 
uniqueness of the representation of n, suppose we have established it for every 


natural n such that n < (k+ 1)!. Taken € N such that (k+ 1)! <n < (k4+ 2)!, 
and assume that 


B 


30. 
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n= ay: 1) +az-2!4+---+ agg (KAD)! = by V4 bp 24+ --- + dg (KAD, 
with 0 < a,b; <j for every j = 1. Then, by the result of Example 3.26, we get 


Oy (K+! <n = by - U4 b2- 24+ + deg - (K+ 2)! 
Stell 4 259! 464 Re + Bags + 1 
=i Fh DD! 1< Gee tl) e+ 1) 
(B.1) 


so that ap4,; < by+4 1. Analogously, by) < ay, and, thus, ag4) = dg41. 
Therefore, we have 


n— agi (K+ 1)! = aye +a: 2! +--+ +ag-k! = by M+ 2-2! +--+ dg kl, 
with (again by the result of Example 3.26) 
n—-aeyi(KEDIS 1-1 4+2-2!4+---+k eM = (K+D!I-1. 


It thus follows from the induction hypothesis that a; = b;, for 1 <j < k. 

For the existence of a representation as asked by the problem assume, as 
induction hypothesis, that for some natural n > 3 every m < F,, can be uniquely 
written as asked. Now, take a natural m such that F,, < m < Fy41.Ifm = Fy+1, 
there is nothing to do. Otherwise, 0 < m— Fy, < Fn41— Fn = Frei < Fh. 
Therefore, by induction hypothesis, there exist an integer r > 1 and 
nonconsecutive indices 1 < ty) <---<?, <nsuchthatm—F, = F;, +-:-+F.. 
Then, m = F,, +-+-+F;, +F ny, and we claim that n—t, > 1. In fact, ifn—t, = 1, 
we would have m > F, + F,;, = Fy + Fr-1 = Fn+1, thus contradicting the 
fact that m < F,,4,. For the uniqueness of representation assume, as induction 
hypothesis, that for some natural n > 3 every natural m < F,, can be uniquely 
represented as asked. Now, take m € N such that F,, < m < F,,41, and consider 
two cases separately: 


(a) m= F,,+1: there are two subcases. First, suppose that F,41 = Fi, + Fi, + 
s+ F,, with r > lil<t<th <<: <t, < nand ti41 — tj > 1 for 
1 <j <r. Then, 
Foor = Fy t+ Fy te +, < Pui + Py + Py ++ + Fi, 
= Fy+1 + Fi, tee t+ Fy, < Fi-1 + Fi, + Fi, +++ F,, 
— Fry 41 + Fi, tet Fi, < Fi-1 + Fi, +++ F, 


< Fi41 <F, < Foti, 
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which is an absurd. Now, if Fn4i = Fi, +++: + Fi, + Fn, then 
Fi, +e + F;, = ntl — Fn = Fa-1 


and, by the uniqueness of the representation of F,-1, we would conclude 
that any other representation of F',4; would be Fy41 = Fn + Fp—1, which 
does not satisfy the conditions of the statement of the problem. 

(b) F, < m < F,+1: let us show that F,, is necessarily one of the summands 
of the representation. In fact, ifm = F,, +--+: + F;,, with t, <n, then a 
reasoning analogous to that of the first subcase of item (a) would give us the 
absurd conclusion m < F,,. Therefore, for a representation of m as in the 
statement of the problem, we should surely have m = F;, +-+-+ Fy, + Fn, 
with t, <n—1. Thus, since m—F, < Fy41—F, = Fy-1, the uniqueness of 
the representation of m follows from the uniqueness of the representation 


of m— Fy. 

Section 4.2 

1. It suffices to note that () = 2a rh) = a) 

2. Adapt, to the present case, the idea of the solution of Example 4.18. 

3. Compare (/) to (,/,) using the definition of (‘). 

4. Make a proof by induction on n. 

5. Apply Stiefel’s relation to the numerator of the fraction and, then, use telescoping 
sums. 

6. Make a proof by induction, applying Stiefel’s relation in the induction step. 

7. Let a, be defined by the expression at the right hand side of the equality we wish 
to establish. Show that a; = 1, a2 = 1 and (using Stiefel’s relation) a,42 = 
An+1 + dn, for every integer n > 1. 

8. Make induction on n; more precisely, show that, for each n € N, the set 
{1,2,...,m} has (7) subsets of k elements each, for every 0 < k < n. For the 
induction step, note that each of the subsets of k elements of {1,2,...,2 + 1} is 
of one of two possible types: those not containing n + 1 — thus, being subsets of k 
elements of {1,2,...,}-— and those containing n + 1 — thus, being equal to sets 
of the type A U {n + 1}, where A is a subset of k — 1 elements of {1,2,...,n}. 

Section 4.3 


1. Use Newton’s binomial formula to expand 4” = (1 + 3)”. 
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1S) 


. It follows from the previous problem that A = 4”. Moreover, by an analogous 


: . 8 
reasoning to that of the previous problem, we get B = 12”—'. Therefore, + = 
B j2n-l gn 


5 =p = 77 Sothatn = 9. 


. Compute the first three terms in the expansion of (1, 1)” = (1 + it)’: 
. The sum we wish to compute equals 


1 11! 11! 11! 11! 11! 11! 
11! (10! 398! St6! 74! 9121 1110! 


All) G)-G) G6) -G) 


at gapped 
11! : 


. The general term of such expansion is (°°) x Now, observe that 


oe Ea a : o : < ! 8 65 k>3k+)ek< 15 
k ] 3k k +1) 3k+1 65—-—k k+1 3 =~, 


Analogously, (C)a > (°) set <> k > 16, so that the maximal term is 


((o) 3 


. The given condition is equivalent to a” + b? + ab = c? + d? + cd. On the other 


hand, it follows from the binomial expansion that a* + b+ (a+ b)* =ct+dt*+ 
(c+d)‘ if and only if at +. b+ +. 2a°b + 3a*b? + 2ab? = c44+d*4+23d4+3c7d? + 
2cd?. Finally, observe that a* + b+ + 2a*b + 3ab* + 2ab? = (a? + b* + ab)? 
and, analogously,c* + d* + 2c3d + 3c?d? + 2cd? = (c? + d* + cd)’. 


. Use the result of Problem 3, page 103—with 2n in place of n—together with 


the theorem on the lines of the Pascal triangle. 


. Adapt, to the present case, the idea of the solution of Example 4.27. 
. It suffices to see that 


» (")i! = rr a =an(1+a)"! 
j=l 


J=0 


and 


=n(n—1)-2"7 47-27), 
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10. 


11. 
12. 


13. 
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Use the formula for the general term of an AP, together with the result of the 
previous problem. 
Write 7 = 1 +a, with a > 0, and use the fact that (1 + a)” > 1 + na. 


Making a = /a + */b, we get (a — /a)" = b. Now, arguing as in the proof 
of the Example 4.23, we conclude that 


(a — Ja)" = (a + agra"? + ap_got”* 4 ++) + 
+ (a,-10""! + dy_3a"* +--+) Sa, 
for certain integers do, 41, 42, ..., An—2, An—1. Hence, 
[(a” + a,—p00"-? + ay—goe"* 4+ ---) — bP? = 
= a(a,_\a" | + a,30" 7 +---)?, 
so that @ is the root of the polynomial equation of degree 2n 
[(X" + dn—2x"? + y—4x"™* + +++) — BP 
= aaa |) baa? ++) =O. 


First of all, let’s find out which is the (constant) value of the given expression. 
To this end, substituting x = 2 and y = z = —1 (observe that x+ y+z = 0), we 


5 eligd 
get? +y +2 = 30 and xyz(xy + yz + zx) = —6, so that SEES Teer =—5 
S) Peele) 
Therefore, we ought to prove that eS = —5, for all nonzero reals x, y 


and z such that x + y + z = 0. To this end, note that 


rey te =P+yt+(—x-yp=r+y-—(@+y) 


II 


¥ +y — (OO + Sxty + 10x*y? + 10x7y? + Sxy* + y°) 


II 


—(Sx4y + 5xy* + 10x°y? + 10x73) 

= —Sxy[@? + y’) + 2xye + y)] 

= —Sxy(x + y)[(@? — xy + y*) + 2xy] 

= —5xy(—z)[(x? + 2xy + y’) — 39] 

= Sxyz[(x + y)(x + y) — xy] = Sxyz[@ + y)(—z) — xy] 
= —Sxyz(xy + yz + zx). 
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14. Applying Newton’s expansion formula twice, we get 


Gps = Se () (+ yr ie! 


1=0 


EEC )e 


E(u 


1=0 k=0 


Now, observe that (7) () = ae so that it suffices to make j = n—k—1 


to get the formula in the statement of the problem. 
16. The general term in the expansion of (1 +x+ ae is (ie) uxt ay = 
(dx, with j + k + 1 = 10. Therefore, we should have k + 1 = 10 —j 


and k —/] = 0, so thatk = 1]=5— £ and, hence, j must be even. Now, make 
j successively equal to 0, 2, 4, 6, 8 and 10 and add all of the corresponding 
10 

summands (, ae i). 

17. Use the nomial J pansion formula to get D0 j4.41=n ) = (14+1+41)" and 
eso ts |) =p) 

18. Make induction on n. To the induction step you will need to use Stiefel’s 
relation, together with the fact that Fy4. = Fi) + Fy, for every k => 1. 


19. Let’s show that ees (-1)* wt on \= 1. To this end, start by writing 


995 


1991 (1991—k 
Le 1) 1991 — warn k )- 
tl 1991 =)... 22 1991—k 
Poe k k 
Be oe Was )+Ee vs { k 


k=0 

sal 101k) 22 1990 —k 
-_ _1)k _1yk 

=Eoo + eo 7) 


Now, forn € N, define S, = yee) by setting (2) = 0 fork > 
n—k, so that Sy = S; = 1. Use the columns’ theorem to get n= Sa = 
yeaa) = 1—S,, or, which is the same, S, = 1 — 7") Sm. From 
this last relation, prove that S,4, = S$, —S,— for every n € Z+, and compute 
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Sy = 0, $83 = Sg = —1, S5 = 0, Sp = S7 = 1. Finally, show that S,, = S, 
whenever 6 divides m — n, and observe that 


x 1k 1991 {1991—k = : 
= 1991—k k = 91991 1989 


Section 5.1 


2. First of all, note that 


(fw) (or) = Sep ts 


a b a 


(a—b) (¢ -«*) = (a—b) (—") 


1 
- sla — b)’(a@ + ab + b’). 
a 


II 


Therefore, it suffices to show that a? + ab + b* > 0. To this end, write 


; , bY. 3b 
at+ab+b={a+-=] +—>0. 
2: 4 
Of course, equality holds if and only if a = b. 
3. For x # a,b,c, to solve the given equation is the same as to solve the second 
degree equation 


(x —b)\(x-—c) + 4-a)(x—c) + @-—a)x—b) = 0. 
Now, observe that the discriminant of such an equation equals 
4(a+b+c)? —12(ab+ac + be) = 4[(a +b? +c’) — (ab +. ac + be)], 


which is a strictly positive number. 

4. Write the left hand side as a(a + b+ c) + bc. Then, apply inequality (5.2). 

5. Firstly, show that a? + b? > (a + b)ab. Then, deduce that ==>— < 
Gre getting analogous inequalities for the other two summands of the 
left hand side. Finally, add the three inequalities thus obtained. 

6. Apply Ravi’s transformation to conclude that is sufficient to prove that (y + z) 
(x + z)(x+ y) = 8xyz, for every positive real numbers x, y and z. To this end, 
use inequality (5.2) three times. 
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7. Apply Ravi’s transformation to rewrite the left hand side as 


1 z x Va x 
s(2+e4eeee tet). 
Xx xX %y yp Z Z 


Then, use the inequality between the arithmetic and geometric means for six 
numbers. 

8. By adequately grouping in pairs the summands at the left hand side and 
applying inequality (5.7), between the arithmetic and harmonic means, we get 


ere eee Se a 1 i 1 Sake) 4 
ib peg ga peg ye (a+ b)+ (c+d) 
and 
b+d d+b 1 4 

= (b+ d) | —— + — ] = (b+ d)-: ——__——_.. 
b+c d+a eer (ao taa)>! “ (b+c)+(a+d) 


Now, it suffices to add the two inequalities above. 
9. The inequality we wish to prove is equivalent to 


Dtxtor pee px! 4 x" > (Qn 1)x" 
or, which is the same, to 
( ae x7") ae Ca) He dip HE et a) > nx". 


In order to prove this last one, it suffices to apply inequality (5.1) n times, 
adding the n inequalities thus obtained. 
10. It follows from (5.4) that 


3a + bt = at +at+at+ bt > 4Vat-at-at- bt = 4|a°d| = 4a°d. 


11. We’ve already shown that a” + b? +c? > ab+ bc + ca, for every positive reals 
a, b and c. Multiply both sides of this inequality by a+ b+ c to get the desired 
inequality. 

12. Since (ab + be + ca)* = (ab)? + (bc)? + (ca)* + 2abc(a + b + c), it suffices 
to show that 


(ab)? + (bc)? + (ca)? > abc(a + b +c). 
Making x = ab, y = be and z = ca in (5.3), we get 
(ab) + (be + CaP =P HV 42 


= xy + yZ+ 2x 
=abc(at+b+c). 
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13; 
14. 


15. 


16. 
17. 


18. 


19. 


20. 


21. 
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Apply inequality (5.3) once more. 

Expand the right hand side and use the algebraic identity (2.5). After perform- 
ing the obvious cancellations, apply the inequality of Example 2.3 three times. 
Then, use inequality (5.5). 

Let S = a4(1 + b*) + BA + c+) + c*(1 + a’). Then, apply the inequality 
between the arithmetic and geometric means for two and three numbers, to get 


See. 4p 2 4 ce oe" 
3 / (2a4b?) (2b4c?) (2c4a?) 
= 6VaobecS 


= 6a°b*c’. 


IV 


Apply the inequality between the arithmetic and geometric means. 

Change a; by —a; and, then, apply the inequality between the arithmetic and 
geometric means. 

For item (a), it follows from the inequality between the arithmetic and 
geometric means that 


n= 1-dens (AEE) -() 
n 2 


For item (b), argue as in (a), applying the inequality between the arithmetic and 

geometric means to the numbers 1”, 27, ..., n? and, then, using the result of 

Example 4.18. 

For | < k < m—1, apply the inequality between the arithmetic and geometric 

means tox +k =x+1+4---+ 1. Then, multiply the m — 1 inequalities thus 
— 


k 
obtained. 


Since }°7_(S — xj) = (n— I)S, the inequality in the statement of the problem 
is equivalent to 


Then, it suffices to apply inequality (5.7). 

For item (a), just expand the product of the left hand side. For (b), apart from 
the hint given in the statement of the problem, apply the result of Problem 8, 
page 104 (with k = 2), to conclude that the sum at the right hand side of item 
(a) has exactly zn(n — 1) summands. 
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22. First of all, note that 


23. 


24. 


1 1 
kyko.. ky (| — bot + 
2 


1 
ky k ) = Prt pa t+ Dn 


Kn 


with pj = ky... kialakeaa ...k,, the hat over k; indicating that the product 


contains all of k,, ko, ..., k,, with the exception of k;. Then, we can write 
ay ay at _ pial! + pao? + +++ prair 
ky ky k,, kiky... ky, , 


Now, expand the summands of the numerator in such a way that each - appears 
p; times and apply the inequality between the arithmetic and geometric means 
to get 


1 
——_[(a¥ foetal) pet ah 4... 4 alr) > 
Kiko... ky ——— _—_—_— 


Pi vezes Pn times 


> (ar)... (ay,)!»)Pn) FR 


i Dik 
_ Ryka kn 
=> a; = 41a2...ay. 
i=1 


Make ap = 0 and apply (5.7) to get 


1 1 P 


aj — aj-1 ai — dao qj 


Then, add the above inequalities for 1 < 7 < n and group equal terms to get 
the desired inequality. Finally, conclude that equality holds if and only if the 
sequence (a,)x>1 is an AP. 

Substituting a, b and c respectively by 6x°, 6y° and 62°, show that it suffices to 
prove that 


7x12 a 12x®y® a Ty®2o aie gy!? te 9x°z6 > 
> ay 4 6x°y 4. 6x7 82? 4 12x yz? 2h 6x*y*z4 zis 6xy’z4 4 6x8 y?z". 
To this end, write the left hand side expression as a sum of seven other 
expressions, chosen in such a way that, when applying the inequality between 
the arithmetic and geometric means to each one of them, we get the seven 


summands of the right hand side. For instance: 


2x6 26 4 pharan 4 2x12 + ay + 2y'? 4 Ixy? > Lrye. 
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1. By Cauchy’ inequality, we have 12 = 3x + 4y < V3?7+4/x?+y? = 
5,/x2 + y?. Therefore, x? + y? > we. with equality holding if and only if 
z= A . However, since 3x + 4y = 12, equality holds if and only if x = 38 and 


y= 35: 
2. Adapt the discussion of the case n = 3, presented in the text. 
3. Making b; = by = --- = b, = 1 in Cauchy’s inequality, we get 


a +a, +--+ an < Yar t+azt+---+aJ/n, 


with equality if and only if there exists a nonzero real number A such that 4 
@=-+-.= & =}, i-., if and only if a), a2,..., dy are all equal. Dividing - 
n ‘both sides of the seve inequality, we get the decided inequality. 

4. Apply the inequality between the quadratic and arithmetic means (see last 
problem) to the numerator of the fraction at each summand above. Then, add 
the results thus obtained. 

5. We want to prove that 


1 
2 
we} ( Le] 2” 
: Xj 
j=l j=l 
“us . 1 
for all positive reals x), x2, ..., Xn. To this end, make a; = ./xj, bj = i and 


apply Cauchy’s inequality. 
6. Multiply both sides by a + b+ c+ d and apply Cauchy’s inequality for n = 4, 


with a) = Ja, a, = Vb, a3 = Je, ay = Vd and b} = zz, by = 
b3 = Je ba = <7. Alternatively, apply (5.7), writing c = $+5, += = ite 


—_d d d d 16 _ 1 1 1 1 
ad=4tGtgtgad = amtmtamt a, 


7. For the left hand inequality, let S = /2x+ 3+ ./2y+ 34+ /2z+ 3. Applying 
the inequality between the arithmetic and geometric means for three numbers, 
we get 


S > 3V/ (2x + 3)Qy + 3)Qz+ 3) 
= 34/8xyz + I2ay + az + yd) + 18 Fy +2) +27 
> 3V18-3 +27 = 3V9. 
For the right hand inequality, apply the inequality between the quadratic and 


arithmetic means for three numbers (cf. Problem 3), with aj = /2x + 3, ay = 
J2y + 3 and a3 = /2z + 3, and use the fact thatx + y+ z= 3. 
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8. Apply the inequality between the quadratic and arithmetic means (cf. Prob- 


lem 3), with a, = ,/ (7). Then, use the result of item (a) of Corollary 4.25. 
9. By Cauchy’s inequality, we have 


10. Since 
aR Gane ge 2 ik by . 
ag + dy atb a+b 
we get 
n 2 n n n 2 
ay by be 
= ag — by) + = 
Darth yt ath kath 
Hence, 
n 2 n 2 
a a+b: 1 
A = b 
bo thy = Blt Do tbe 5 lat K) 
and it suffices to show that x+y > aa for all positive reals x and y. This 


follows immediately from the incaualiey between the quadratic and arithmetic 
means (cf. Problem 3). 
11. Start by letting a,4; = a,. Now, for 1 < j < n, use Cauchy’s inequality to 


write 
a? 
qj+1 


a 2 
dj+1 Ll + aj 


IA 


2 
a: 
(+a)? =[1-1+ /@97-—H 
ij V4 aa 


so that 


Then, multiply the n inequalities obtained in this way to reach the desired one. 

12. Perform Ravi’s transformation (according to the discussion that precedes the 
statement of Problem 6, page 120) to show that the given inequality is 
equivalent to 
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13. 


14. 
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Vx + /2y + V2z< JV FZ t+ MEF E+ VEFY, 


for positive reals x, y and z. Then, use the inequality between the quadratic and 


arithmetic means (cf. Problem 3) to show that ,/ a > EN with equality if 


and only ity = z; analogously, get y=E = 4M ana (5H = AG, ana 
add these three inequalities to get the desired one. 


Use Cauchy’s inequality to get 


n 2 n n 
(> anc < (20007) (> 2) : 
k=l k=l k=l 


and 


Now, check that the inequality between the quadratic and arithmetic means (cf. 
Problem 3) gives 


Finally, it suffices to combine both inequalities above. 
Make x = 1, y= i, 2= 1 and, then, apply Cauchy’s inequality to get 


x2 y 22 


ytr x+2 x+y 


(+94+6+9+ +9 Je etyte? 


Finally, use the inequality between the arithmetic and geometric means to 
conclude the proof. 


Section 5.3 


1. 


Start by observing that 


a+)" =(14 l )(1- 1 ) 
(+g) Xo ntl (n+ 1?) 


Then, apply Bernoulli’s inequality. 
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2. Write a” = m"™ (1 + amym and apply Bernoulli’s inequality. Do the same with 


3. 


Nn 


a” and, then, add the results. 
The desired inequality is equivalent to 


a® b® rou 


Pe ce) Gop = ab + ae + be. 


Suppose, without loss of generality, that a < b < c. Then, a® < b® < c® and 
o < aoa < a. Applying in succession Chebyshev’s inequality and the 
inequality between the arithmetic and geometric means, we get 


a& be reg 1 1 1 1 
+ + > (a +0 +) ( + + ) 


Re ' gc ' gb — bee | ge | a2be 


1 1 1 
2222 
oo (= = ac? - =z) 


eCt+h+c’. 


IV 


II 


Finally, since we already know that a@+bh+c > ab+ac +t bd, there is 
nothing left to do. 


. Write Eat + 1G free Rises = x “xy + a Xo ferret ae + X1994 and, then, 


apply Chebyshev’s inequality to get 
1 
AE AAG tot Tong 2 ogg Gel $22 +21 + Ahoosd Gr +2 +++ + 1994): 


Now, use the equations of the system. 


. Apply Chebyshev’s inequality to each summand of the left hand side. 
. Let’s show that, for every n > 1, one has 


Y Aiaibi _ (> 2a) (> ay > 0. 
i=l i=1 i=l 
To this end, start by observing that the left hand side equals 


2 Ai(Ar bee FA He + Andaibi — Ds AA jaibj, (B.2) 
i=1 ij=1 
idi 
where ae indicates that the summand (; is omitted. Then, notice that, to each 


pair (i, 7) with i <j, the sum 


Aid aid; + jd jajb; = Ajdjaib; = Vid jajbi = —hjAj(ai = aj) (bj es bj) 
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appears in (B.2) exactly once, which means that 


> Ajaibj — S na) (> us) Vid, (ai = aj) (bi = bj) > 0. 
i=1 i=1 


i=1 


Equality holds when all of the a;’s or all of the b;’s are equal. Finally, the usual 
Chebyshev’s inequality corresponds to the particular case in which A; = 1, for 
l<i<n. 


. Apply Chebyshev’s inequality to conclude that the expression at the left hand 


side is greater than or equal to 


L@ +h 404d) a ee 
—(a Cc °) |§ —— + ——_ +. ———__ + ——_—_ ] 
4 b+c+d at+ctd at+tb+d a+b+e 


Then, apply the result of Corollary 5.20, together with (5.7), to conclude that 
the last expression above is greater than or equal to 


at+tb+c+d\? 4? _ (at+b+ce+dy 
4 3(a+b+c+d) — 12 , 


Finally, observe that the condition given in the statement of the problem is 
equivalent to (a+c)(b+d) = 1, and apply the inequality between the arithmetic 
and geometric means. 


. By symmetry we can suppose, without loss of generality, that a > b => c. This 


1 


way, we get -— > 3 > =,- Applying Chebyshev’s inequality and (5.7), we 
obtain 
a” b” c" 1 1 1 
+ + >s a" +b" +c" 
b+e a+e =3(5 ate —)( ) 


3 
n b” ny 
tanpeo 7 


Also by Chebyshev’s inequality, it follows that 
n n n 1 n—1 n—1 n—1 
a’+b"+c Bq eee +b" +c"), 


which completes the proof. 


. Suppose, with no loss of generalidade, that x > y > z. Use Chebyshev’s 


inequality to conclude that the expression in the statement of the problem is 
greater than or equal to 


(++). «+D+604+)D4+@+) 
3 + D0+ D+) 
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10. 


ine 


12; 
13. 


Apply Chebyshev’s inequality to the first factor and the inequality between the 
arithmetic and geometric means to the second factor to conclude that the last 


a5 , where t = a(x + y+ z). 
Finally, use again the inequality between the arithmetic and geometric means 
to conclude that ¢ > 1, and prove that t > s > ao > oe 

In order to get the first inequality, apply Chebyshev’s inequality to the first 
sum, followed by the result of Corollary 5.8. For the second inequality, use the 
inequality between the arithmetic and geometric means. 

Suppose, with no loss of generalidade, that aj < ay < +--+ < d,. Arrange the 
given numbers in n pairs (bj, c,), (b2, C2), -.-, (On, Cn), With b} < bp < +--+ < Dy 
and b; < c;, for each j. We wish to maximize 


expression above is greater than or equal to 


S= bic} + boc2 a DnCn. 


To this end, we will show that c; < cz <--- < cn. Indeed, if the sequence (c;) 
isn’t nondecreasing, there will exist indices i > j such that c; < c;. In this case, 
we change the positions of c; and c; in S, after which the new sum will be 


S=S- bic; — bjc; + bic; + bjci =S+ (bj = bi)(cj = Ci) >. 


Hence, the sum will be maximal when c; < co < -:: < Cy. Finally, observe 
that, for all indices i < j, we have b; < c; < c;. Suppose that, for some pair of 
indices i < j, we would have b; < c;. Change c; by b;, so that the new sum is 


Sv =s- bic; — bjcj + bjb; + cic) = S+ (bo) -— cj) (bj —c)>S. 


Therefore, we must have b; < c; < b; < cj whenever i < j. In general, we must 
have 


by Sc) S bo Sc. S+++ Sdn Seq, 
so that the maximal sum is a)a2 + a304 + +++ + Ay] A. 


Adapt, to the present case, the idea of the proof of rearrangement’s inequality. 
For item (a), it suffices to see that 


a(x + y—a) —xy = (ax—xy) + a(y—a) = —x(y— a) + aly—a) 
= (y—a)(a—x) = 0, 


for y > a => x. In what concerns (b), we want to prove that, given an integer 
n > | and positive reals x),...,X,, one always has 


AM (x1,...5%n) = GM (x1,...,%n); 
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where AM and GM denote the arithmetic and geometric means, respectively. 
To this end we can suppose, without loss of generality, that x; <--- < x,, with 
xX, < X,. Ifa = AM(x,...,Xn), then x} < a < x,. Similarly to the proof of 
item (a), we obtain a(x, + x, — a) > x1Xp, so that 

AM (x1 + Xn — 4, %2,.--,Xn—1,@) = AM (x1, %2,..-,Xn—15Xn) 
and 

GM (x1 + X% — 4, X2,.-.5Xn—1,@) > GM (x1, %2,.--,Xn—15 Xn): 
If the numbers x; + xX, — G,%2,...,Xn—1,q@ are all equal, we will have 

AM (x1 + Xn — G,%2,.--,Xn—1,@) = GM (x1 + Xp — G,%2,.-.,Xn-1,) 
and, therefore 
AM (41, %2,---,Xn—1,@) > GM (x1, X2,...5Xn—1, 4). 

Otherwise, we order those n numbers as y, < --- < y,, with yj < yy. Since 
AM (y1,...,¥n) = @, it follows that y) < a < y,. Now, changing y; per y; + 


Yn — a and y, per a we obtain, as above, 


AM (1 + Yn — 4, Y2,-++5Yn-1,4) = AM (y1, 2, ---. Yn—1 Yn) 


and 

GM (y1 + Yn — 4, Y2,-++Yn-1,4) > GM (91, 92, -- + Yn-15 Yn). 
Among the numbers y; + y,—4d, y2,..-, Yn—-1, a We now have at least two which 
are equal to a. If the numbers y; + y, — a, yo,...,Yn—1,@ are all equal, we will 
have 


AM (1 + Yn — 4, Y2,-+-5Yn—1,4) = GM (1 + Yn — A, 2,--- Yn @) 


and, hence, 


AM (91, 2, -+ +5 Yn—1s Yn) = GM (1, ¥2,---.Yn—15Yn)- 


Otherwise, we operate a third exchange of numbers, as described above. 
Observe that this algorithm ends after a finite number of steps, and when we 
reach this point all numbers in our list will be equal to a. In particular, the cor- 
responding arithmetic and geometric means of this las set of (n equal) numbers 
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will be equal. However, since each performed operation preserves arithmetic 
means and increases geometric means, it follows that, at the beginning, we had 


AM (x1,X2,....%n—1,@) > GM (x1, x2,...,Xn-1, 4). 


14. Make a proof by induction on n > 1. For the induction step, it suffices to prove 
that 
2 2 2 
a a a 
(1+ a ) 1+ > (14 2) dba) 
An+1 a a 
or, equivalently, that 
1 1 a. 
2 2 n 
a -—+a4,- = an41 + —. 
a ae 
Finally, observe that such an inequality is true by an immediate application of 
the rearrangement’s inequality. 

15. (By Prof. Emanuel Carneiro) Let’s make induction on n, leaving the initial case 
n = 3 as an exercise. More precisely, let’s show that, in order to maximize the 
left hand side expression, one of the x;’s must be equal to 0, and that this allows 
us to invoke the induction hypothesis. To this end, let 

E(x1,X0,---;Xn—1,Xn) = xX? + 15X3 + 3X4 fee p Ox. 
and suppose that the expression attains its greatest value” for some sequence 
(x1,X2,...,X») such that none of the x;’s is equal to 0. Substituting x,-; by 0 
and x; by x; + %,-1, we obtain a new expression, which is less than or equal to 
the original one, i.e., which is such that 
O = E(x, + Xp—-1,X2,.-., 0, Xn) _ E(x1,X2, see Xn—1,Xn) 
2 2:32 2 
= 2X1 X2Xp—1 + X2X__y + Xn-1Xy, — Xp—oXn—-1 — Xp Xn- 
Upon dividing this last expression by x,_1, we get 
2x 1X2 + XoXp—1 + x _ aan —Xn—-1Xyn < 0. 
Analogously, performing the exchange operations 
Xj) +> 0 and x41 > X41 + X-1, 
2Here, we're tacitly assuming that there exists a sequence (x1, x2,..., Xn) such that x) + 2x2 +++ 
Xn = land E(x, %,...,2 x,) is maximal. Although this could be rigorously proved, such a proof is 


beyond the scope of these notes, so that we shall not present it here. 
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for 1 <i<n (with xo = x, and x,4+1 = x1), we get the inequalities 
Wx. 1x: 215 X 2 _y <0 
Xi+1Xi+2 + Xi+2Xi-1 + Xx; Xi_-2 Xi-1Xi SY, 


for 1 < i <n. Adding all of these, we arrive at the inequality 


n 


Yo xirigt + xi43) <0, 


i=1 


which is an absurd. Therefore, if (x1, x2,...,%,) maximizes E(x1,%2,...,Xn), 
then at least one of the x;’s must be equal to 0. Suppose, with no loss of 


generality, that x, = 0. Then, 
Emax _ E(x, x2, wee yXn—-l, 0) 
= xpX2 ate 15X3 a aa See Mod 
S xx. +.2x5K3 + 5K4 Fe Fon FM 
4 
< aaa 
~ 27 
where, in the last inequality, we applied the induction hypothesis. 
17. Make 1; = oe for 1 <j <n, and conclude that 


byte + By = ay tees dy & ai(Ay 1) an(A2—1) +++ a(n 1) = 0. 


Then, apply Abel’s inequality to get 


Y > aj(Aj — 1) = ay min {Ay — 1,41, +.A2—2,..., Ar +20 + An — a}. 
j=l 
Finally, use the condition given in the statement of the problem, together with 
the inequality between the arithmetic and geometric means to show that A; + 
std, >k, forl <k <n. 


Section 6.1 


1. We must have x — 1 > 0 and 3 — x > 0, so that x € [1, 3). 


2. We must have x > 0,3 —./x > 0, 3- /3—/x > Oand 5-4/3 - JV3—/x. 
The maximal domain of f is the set formed by the intersection of the solution 
sets of these inequalities. 
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3. 


Nn 


11. 


For item (a), just see that f(1) = f(+) = El = 0, fo) = f (42) = 


Hoa = ands (#) =f (2) = 24! = 5. Foritem (b), if |a2—b?| = 55 
and a* + b? = 73, then (a,b) = (8,3) or (3,8), so that f (3) =f (8) = =. 
If ja? — b*| = 32 and a? + b? = 257, then 2a” = 289 or 2a” = 225 and, 
hence, a ¢ N. However, setting a” — b? = 32k and a* + b* = 257k, with 
k € N, we get 2a” = 289k and 2b* = 225k, so that k = 2 gives a = 17 and 
b = 15 (analogously, by letting b* — a* = 32k, we obtain a = 15 and b = 17). 
Therefore, f (2) =f (2) = 34. Finally, if a?—b? = 101k and a? +b? = 89k, 
for some k € N, then a? = 95k and b* = —6k, so that (ab)? = —570k?; if 
b* — a* = 101k and a* + b? = 89k, we conclude, analogously, that (ab)? = 


—570k?. In any case we reach a contradiction, for, (ab)* > 0 > —570k?. 


. Start by making x = 1 and y = V2 in the given relation, to compute f(1+ V2). 
. One has to prove that f(az41) = f(ax) + f(r), for every k > 1. To this end, let 


x = azand y = r, so that x + y = agy. 


. Adapt, to the present case, the hint given to the previous problem. 
. Inthe case of £, notice that one has to shrink the common domain X of f and g, 


to avoid those x € X for which g(x) = 0. 


. The image is the set Z of integers. 
. Start by showing that {x} € [0, 1), for every x € R. Then, show that the image 


of {-} is precisely the interval [0, 1). 

If a, denotes the n-th natural number which is not a perfect square, then there 
exist positive integers s and t, such that 1 < s < 2tanda, = +s; in particular, 
?’ is the greatest perfect square which is less than or equal to a,. Since there are 
exactly a, integers from 1 to a,, exactly t of which are perfect squares (these 
being 17, 27, ..., 17), it follows that a, is the (a, — t)-th non perfect square. 
Hence, n = a, —t = + s—t, so that 


t= 1 Gongs 
=,/n—-s+-—-4+-. 
4° 9 


Now, since s > 0, we have 


| 1 1 1 
dn =tl+saenttent RLS ES SM NETS 


On the other hand, by using the fact that s < 27, we get, as above, a, > n+ 
Jn — 5. Therefore, 


1 1 
(n+ Vit 5)-1 <a <nt vi, 


and, thus, a, = [n+ /n+ 3]. 
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For the first part of item (a), make x = y = 0; for the second, make y = —x. 
For item (b), use (a). For item (c), consider initially the case of m € N, by 
induction; then, use (a) to get the case m < 0. Finally, apply the result of (c) to 
get (d) and the results of (c) and (d) to get (e). 


Section 6.2 


Noe 


Nn 


. Show that it is equal to (0, +00). 
. Start by recalling that x + + > 2, for every real x > 0, with equality if and only 


if x = 1. Then, use this fact to conclude that the image is R \ (—2, 2). 


. Let’s analyse the case in which f increases in (—oo,a] M J and decreases in 


[a, +00) MJ (the other case is completely analogous). For x9 € J such that xp < 
a, we have f(x) > f(a), so that x9 is not a minimum point of f. Accordingly, 
for xo € IJ such that x) > a, we have f(xo) > f(a) and, as before, xo is not a 
minimum point of f. Hence, a is the only minimum point of f. 


. For yo € Y, take x9 € X such that f(xo) = yo. Then, by the definition of the 


function f + c, we have (f + c)(%)) = f(x) +c = yo +c, So that yo +c € 
Im (f + c). Since yo + c is a generic element of Y + c, it follows that Y +c C 
Im (f +c). Now, if y; € Im (f+ c), then, by definition, there exists x, € X such 
that (f-+c)(x,) = y or, which is the same, f(x))-+c = y;. Then, f(x1) = y;—c, 
and hence y; —c € Im(f) = Y. This way, y;) = (yi —c) +c € Y+c and, since 
this is valid for every y; € Im(f +c), we get Im(f +c) C Y +c. Therefore, 
Im(f+c)=Ye. 


. Adapt, to the present case, the hint given for the previous problem. 
. Let’s consider the case a > 0, the case a < 0 being totally analogous. For 


_A 
4a 


y € R, we have f(x) = y if and only if (x + by? 


5 = » or, which is the 
ca a 
same, 


b \? A 4 A 
(x+ 2) =24 oe 


2a a’ 4a 4a? 


Since (x + Ey > 0, the last equation above has a solution if and only if 4ay + 


A > 0. However, since a > 0, such a condition is equivalent to y > —4. 


Hence, there exists x € R for which f(x) = y if and only if y > -4, so that 
Im (f) = [-4, +00). 


. Use the canonical form f (see the statement of the previous problem) to get, in 


the case a > 0, 
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8. 


10. 


Ls 


12. 


xy<0¢ ee 7A <0 ee ae 0 
= Ss Dai ome = sS 
Fe al De 2a 4a 2a 4a 


? + : ? + . < é So? e <x+ z a 
—|)<— x+— — —_— —<— 
* 2a 4a 2a 2a 2a a 2a 2a 


In order to show that A > 0, use the canonical form of f to get 


, b\> A 
Hey =6 (+>) ae le 


so that af (xo) < 0 if and only if (xp + ay - a < 0. Now, since (x9 + ay > 
0, we must have 0 < (x0 + a) < a 
use the result of the previous problem. 


and, hence, A > 0. For the second part, 


. Ifa rectangle of perimeter 2p has dimensions x and y, then x + y = p. Show 


that its area depends on x according to the quadratic function f(x) = —x + px, 
and apply the result of Proposition 6.25. 

Letting / be the width and h be the height of the truck (measured in meters), 
it is immediate that the load volume of each truck is 18/mm?. Hence, we have 
to maximize the product /h, subjected to the condition that the trucks can enter 
the tunnel using the correct lanes. Since the cross section of the trucks are / x h 
rectangles, the common length d of their diagonals should satisfy d < 5m. 
However, by Pythagoras’ Theorem, we have d = //? + h? and, hence, 


Ih = IW —P <IW25-P = V25P —F. 


Setting x = P, we conclude that it suffices to maximize the quadratic function 
f(x) = 25x —x?, withO <x <5. 

We must find the smallest positive real number a such that f(x) < a, for every 
x € R; equivalently, this is the same as finding the smallest positive real number 
a such that ax? — 5x + (a + 1) > 0, for every x € R. 

For even n, say n = 2k, using the triangle inequality we get 


F(X) = (le — a] +--+ + [x — |) + (leet — x] +--+ + [ere — x1) 


k 2k 2k k 
> i@-a)+ DI @-D= DI ay) ay. 
j=l j=kt+l j=k+1 j=l 


Now, show that every x € (Q, @,+41) satisfies f(x) = paar aj — ye Qt}. 


Finally, the case of an odd n can be dealt with in an analogous way. 
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14. 


15. 


16. 


17. 
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For item (a), apply the inequality between the arithmetic and geometric means 
to the Henoominiet of the defining formula for f(x).For item (b), write f(x) = 
2+ = a and, then, proceed as in (a). Finally, for item (c), apply the inequality 
between the arithmetic and geometric means to get 


e(1l—-xy = 530 -xy 
1 Se) 
c) a [ne 3 


For item (a), write ae a ae @ +1) + = + 10; for 


item (b), write ¢ = + 14 >; for (c), write P= x +32 finally, an (d), write 
6x = 3x + 3x. 

Start by writing x* + y* = (x? + y*)? — 2(ay)* < (2 + xy)? — 2(xy)’, and look 
at this last expression as a quadratic function of z = xy. 

First of all, prove that the function f : R > R, given by f(x) = x° + 2x, is 
increasing. Then, observe that the equations of the given system can be written 
as y = f(x), z = f(y), t = f(z) and x = f(f). Finally, suppose that x > y and 
use fact that f is increasing to conclude thatx > y>z>t>x. 

For fixed m,n € N, let Ik = [(Amn+1)n, (kmn+ 1)m]. Since ((k+ I)mn+t 1)n < 
(kmn + 1)m for k > mt there exists kg € N such that 


(m—n) ” 


'o Ix = [(Komn + 1)n, +00). 
k>ko 


Since f has infinitely many strangulation points, there exist natural numbers 
k > ko and p such that p is a strangulation point of f and p € J,. Then, f((Amn + 
1)n) < f(p) < f((kmn + 1)m). However, kmn + 1 is relatively prime to m and 
n, so that f(kmn + 1) + f(n) < f(kmn + 1) + f(m) and, hence, f(n) < f(m). 


Section 6.3 


1. 


For every z € R, we have |z| > z if and only if z < 0. Hence, we want to find 
the set of x € R for which g(f(x)) < 0. Make y = f(x), and apply the result of 


Problem 7, page 161, um ee that —4 <y< 5. Now, recall that y = f(x) 


to show that —4 <x—i <3 


. Start by observing that (f o i= = f(g(x)) = 2g(x) + 7 and, hence, that 


2g(x) +7 = x* - 2x43. 


. Make g(x) = y to obtain x = “© and, then, f(y) = f(g(a)) = 2x? -—4x+1= 


2(e2)'-4(e2) +1 
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4. 


10. 


11. 


12. 


13. 
14. 


13; 


Since f o g and go g are functions from R to itself, we have fog = gog > 
(fo g)x) = (geoff) > f(g) = g(f(a)). Now, it suffices to compute 
S(g(x)) = (ac)x + (ad + b), g(f(x)) = (ac)x + (be + d) and compare the 
results. 


. For x, f(x) 4 —b, we have 


f@+a _ =e (a+ 1)x+ (ab+a) 


f@tb S44b° (+ 1x+ Gt) 


FF) = 


pete = x for every x € R, 


except for at most four values of x. Hence, except for such values of x, we have 
(a+ 1)x+ (ab+ a) = (b+ 1)x? + (a + b*)x. Finish by showing that b = —1. 


From this, conclude that we must have 


. Let’s analyse the case in which f and g are increasing, leaving the analysis of 


the other two cases to the reader. Given x; < x, in J, it follows from the fact 
that f is increasing that f(x) < f(x2). Now, since f(x,),f(%2) € J and g is also 
increasing, we have g(f(x,)) < g(f(%)). 


. Make x = < in both given relations. 
. Verify that F is injective by showing that F(x;) = F(x.) => x, = x2. Show that 


the surjectivity of F reduces to its very definition. 

Start by assuming that f = g + h, with g,h : X — R such that g is even and 
h is odd. Then, use the definitions of even and odd function to conclude that 
g(x) = 4 @) + f(—x) and h@L GF) — f(—9). 

Initially, show that f(1) = 0. Then, make a = | and b = —1 to compute f(—1). 
Finally, make b = —a. 

It suffices to compute (f 0 f)(—x) = f(f(-») = f-f@) = -fF@) = 
—(f of)(x), where, in the second and third equalities, we used the fact that f is 
odd. 

Take f(x) = —g(x) for x < 0 and f(x) = —x for x > 0. 

Start by writing 


4x? = A(x? — 1) 4 


k 
x+1 x+1 rari 


f@) = 


x+1 


4 1 
= 4(x- 1) + ——+k=4 1+——]+(k-8). 
@-1)+ sit (x+ +)+ ) 


Hence, letting ga) = x+1+4+ =: we have f(x) = 4g(x) + (k — 8). 
Now, Problem 2, together with the fact that g is an odd function, implies that 
Im (g) = R\(—2, 2), so that (by the result of Problem 5) Im (4g) = R\(—8, 8). 
Finally, Problem 4 guarantees that Im (f) = R \ (k — 16, k), and we must have 
k—16=-—Landk= L. 

For item (a), start by showing that every x ¢€ R can be written as 
x = kp+a, with k € Z anda eé [0,p). Then, make induction on k to 
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14. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 
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show that f(x) = f(x — kp), for every k € Z. For item (b), first observe that 
g (x + 4) = f(ax + p) = f(ax) = g(x), for every x € R; subsequently, note 
that if p’ > 0 satisfies g(x+p’) = g(x) for every x € R, then f(ax+ap’) = f(ax) 
for every x € R. Finally, use this fact to deduce that |a|p’ = |ap’| = p. 

In order to show that f is odd, start by observing that f(20 + x) = f(10 + 
(0+ x)) = f00 — 10+ x)) = f(—+x) and, analogously, f(20 — x) = f(x). 
Then, use the fact that f(20 + x) = —f(20 — x). For the second part, compute 
f(40 + x) = f(20 + (20 + x)) = —f(20 + x) = f(x). 


Making g(x) = f(x) + 5, show that g(x + a) = , [4 — g(x)? and, hence, that 
g(x + 2a) = g(x), for every x € R. 

If X = {x € Z; x > 100}, then Im(f) = f(X) U f(Z \ X). On the one hand, 
f(X) = {y € Z; y > 90}. Now, if 90 < x < 100, then 101 <x+ 11 < 111 and 
f(x) =f + 11)) =f@ + 1). Therefore, f(90) = f(91) = --- = f(100) = 
fGCA11)) = fA01) = 91. Use strong induction to show that f(90—x) = 91 for 
every integer x > 1. For the induction step, if f(90 — x) = 91 for every x < n, 
observe that, for x = n, one has f(90 —n) = f(f(101 — n)) = f(91) = 91. 
Finally, conclude that Im(f) = {91, 92, 93,...}. 

From f(f(”)) = 4n + 1, we get f(4n + 1) = f(Ff(f(™))) = 4f(n) + 1. On the 
other hand, it’s easy to prove by induction, with the aid of items (a) and (b), that 
f(2*) = 2**! — 1. Consequently, again by item (c), we obtain f(2**! — 1) = 
f(f(2)) = 4-2* + 1. Due to these facts, and since 1993 = 4-498 + 1, 
we successively compute f(1993) = 4f(498) + 1, f(498) = 2f(249) + 1, 
f(249) = f(4- 624+ 1) = 4f(62) + 1, f(62) = 2f(31) + 1 and f(31) = 
f(®-1) =4-2441=65. 

Write N = A; UA2 UA3 U..., with Aj, Ao, ... pairwise disjoint infinite sets. 
Then, for each n € N, set f(x) = n whenever x € Ap. 

For item (a), use the assumptions on A and B to construct an injection from 
A x B to N x N. For item (b), take f : Z x N > Q such that f(m,n) = 2. The 
first part of (c) now follows from (b) and Lemma 6.43. As for the second part, 
let g : Q— N be defined by letting g(r) = min{n € N; f(n) = r}. 

For item (a), let Bj} = A, and, once we have defined B,,..., By), for some 
natural k, let By = Ag\ (UIE1A)). For (b), since B, C Ax, we conclude that B, is 
finite or countably infinite. If f, : B, — N is injective, let f : ),., Bk > NxN 
be given by f(x) = (f(x), k) if x € By and verify that f is injective. Finally, (c) 
follows from (b) as a much easier particular case, arguing by contradiction and 
taking A; = Tand A, = Q. 

Suppose, for the sake of contradiction, that F = {A,,A2,A3,...}. Define A = 
{x1,%2,3,...} C N by choosing x, ¢ Aj, then x2 ¢ Az such that x. > x; + 1, 
then x3 ¢ A3 such that x3 > x2 + 1 etc. Since x, ¢ Ax, we have A # Ax. On the 
other hand since x, < x2 < x3 <---, we conclude that A is infinite and N \ A 
contains the infinite set {x; + 1,x. + 1,x3 + 1,...}, hence is also infinite. 
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RW 


10. 


11. 


. For item (a), use Proposition 6.39. For item (b), suppose that g1, g2 : Y > X 


satisfy g;0 f = Idy and fo g; = Idy, fori = 1,2. Use Proposition 6.34 to get 
81 = 810° Idy = g1 0 (f 0 go) = (81 Of) 0 2 = Idx 0 2 = 0. 


. Use several times the result of Proposition 6.34, together with the fact that f—! o 


f = Idx, fof ! = Idy, go g7! = Idz and g-!og = Idy. 


. Apply the result of Example 6.48. 
. See the hint to the next problem. 
. For the first part, note that the equation 


ax+b 
cx+d 


= yhas, for every y # 4, the 
—dy+b 


cy—a * 


single solution x = 


. For the first part, it suffices to see that, for 0 < x; < x2, we have x. — x, > 0; 


hence, item (a) of Problem 18, page 26, gives 
xy — XT = (2 = x)Or $x fee $571) > 0, 


so that f(x;) < f(x) and f is increasing, thus injective. For the second part, 
let y € [0, +00) be given. Since we are assuming that Im(f) = [0, +00), there 
exists x € [0, +00) such that y = x”; then, by the definition of the n-th root of 


y, we getx = 7/y, so that f~!(y) = wy. 


. Setting f(x) = 2x, we have f-!(x) = 5, so that (f +f-YQ@) = = and f - 


2 
f- D@® = = These last two functions are bijections from R to itself. 


. Suppose that f is increasing (the case of a decreasing f is entirely analogous). 


Given y,; < y2 in J, take the elements x,,x. € J such that f(x;) = y, and 
Sf (x2) = yo. If xy = x2, then yy = f(%1) = f(%2) = y2, which is not the case; if 
X2 < x1, it follows from the fact that f is increasing that y. = f(x2) < f(x) = 
y1, which is not the case either. Therefore, x; < x2 or, which is the same, 
f 'O1) <f'02). Since the elements y; < y2 of J were chosen arbitrarily, this 
assures that f—! is also increasing. 


. Show that, for fixed a, b € R, the system of equations x* = a and x — f(y) = b 


admits the single solution x = ¥/a and y = f~!(</a— b). 

Start by showing that it suffices to prove that any two functions in G commute 
in respect to the operation of composition of functions. To this end, use items 
(a) and (b), together with the fact that the only function h € G of the form 
h(x) = x +a, for some a € R, ish = Idp. 

Set g = f—! and start by showing that what is needed to be proved is equivalent 
to the existence of positive integers x < y < z, in arithmetic progression and 
such that g(x) < g(y) < g(z); then, show that we can suppose that x = 1. To 
what is left to do, fix t € IR and show that one cannot have g(t) > g(2t— 1) > 
g(4t— 3) > g(8t—7) >--- > g(1). 
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1. If there existed x, < x2 in J such that f(x,) = g(x) and f(x) = g(x), we 
would have f(x1) > f(2) = g(x2) > g(xi) =f). 

2. Initially, note that x = 4 is a solution. Then, look at both sides of the given 
equation as functions from (0, +00) to R and apply the result of the former 
problem. 

3. Letting g(x) = f(x) — f(0) in the given relation, conclude that it suffices to 
consider the case in which f(0) = 0. Under this additional hypothesis, make 
y = Oto get f (3) = ao for every x € Q. Then, use the given relation to 
conclude that f(x + y) = f(x) + f(y), for all x, y € Q, so that (according to the 
first part of the solution of Example 6.56) f(x) = f(1)x, for every x € Q. 

4. Use the given relation to prove by induction that f(n) = f(1), for every n € N; 
then, consider the case n < 0 by making y = —x. 

5. Make x = y= z= Oto get f(0) = 53 then, obtain f(1) = 5 by an analogous 
reasoning. Make y = z = | to conclude that f(x) > 3 for every x € R. Finally, 
make an analogous substitution to prove that f(x) < 3 for every x € R. 

6. In each of the intervals [n,n +f(n)] and [f(n), 2f(n)] there are f(n) + 1 naturals. 
Use the increasing character of f, together with the given relation, to conclude 
that f(n + k) = f(n) + k, for every 1 < k < f(n). Now, fix natural numbers 
k and n, with n > k. Use the fact that f(n) > n to show that f(n) > k and, by 
what was done above, that f(n) =n —1+f(1). 

7. Set x = a = 0 in (a) to get f(0) = O. Then, letting x = 0 in (a), show that 
f(@ = a for every a € Z. Now taking x € [0, 1), use the fact that f(x) € Z and 
S(f(@)) = 0 to conclude that f(x) = 0. Finally, for a general x € R, change x 
by {x} (the fractional part of x) and make a = |x| in (a) to obtain f(x) = [x]. 

8. Start by computing f(x + f(y + f(0))) in two different ways to get f(0) = 0. 
Then, deduce that f(f(y)) = —y for every y € Z, so that f is bijective. From this 
last relation, compute f(f(f(x))) in two different ways to conclude that f is odd. 
Change x by f(x) in the given relation to obtain f(f(x) + f(”)) = f(Ff@ + y)) 
and, then, f(x) + f(y) = f(@ + y), for all x,y € Z. Finally, use induction to 
get f(x) = f(1)x, for every x € Z, thus arriving at the contradiction —1 = 
FC) = fA)FC) = 0. 

9. Set k = 0 to conclude that f(1) = 2; then, use induction to show that f(n) = 
n+ 1, for every n € N. Now, use the given relation to show that, if f(—1) = a, 
then a < 0 ora > 0 lead to contradictions. Finally, make another induction to 
conclude that f(—n) = —n + 1, for every n € N. 

10. First of all, show that we can suppose that f has no fixed points. Then, write 
A = BUC, where B and C are disjoint sets such that f(x) > x for every x € B 
and f(x) < x for every x € C. If B has / elements and a is the common value 
of |f(x;) — x;|, conclude that 0 = ae (f (xj) — xj) = (21— k)a and, hence, that 
a = 0, which is a contradiction. 


11. 


12. 


13. 


14. 


15. 
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Set x = y = 0 in the given relation to obtain f(0) = 2f(0)f(a) and, hence, 
f(@ = 5: Set just y = 0 to get f(x) = f(a — x) for every x € R, so that 
f(—x) = f(a + x), also for every x € R. Substitute y = a in the given relation 
to get 


1 1 
f(a) = fe + a) =f O) + f@fla=¥) = sf) + sf) =f, 


for every x € R. Now, let y = —x in the given relation to find 


1 
5 =FO =fe@fla+ x) +f(-ofla—») 
= f@Of(—») + fof) = 2f 0. 


Therefore, if we show that f(x) > 0 for every x € R, it will follow from the last 
relation above that f(x) = 5 for every x € R. To what is left to do, it suffices to 
make y = x in the given relation, to obtain 


fQx) =fOfla—x) +fOfla—x) = 20)" 


and, hence, f(x) = 2f (3) > 0, for every x € R. 

Set x = y = Oto get f(0) = 1. Then, use induction to prove that f(nx) = f(x)", 
for all x € Q andn € N; therefore, use this relation to show that f(x) = f(1)*, 
for every x € Q?.. Now, letting y = —x, show that f(—x) = f (x)! and, hence, 
f(@) = fC)’ for every x € Q. Finally, use the fact that the codomain of f is the 
set of positive rationals to conclude that f(1) = 1. 

Making x + y = a and x — y = BD, show that f(a) + f(b) = 2f (4°), for all 
a,b € [0, 1]. Setting x = y, show that f(2x) = 2f(x) for every x € [0, 1], and 
conclude that f(a) + f(b) = f(a + b) for all a, b € [0, 1]. From this point, show 
that f(x) = x for every x € [0,1] N Q To what is left to do, reason as in the 
passage from Q to R in Example 6.56. 

For item (a), start by making m = 0 in the given relation to get f(f(n)) = 
f (0)? +n; then, conclude that f is a bijection. Take k € Z for which f(k) = 0, 
and let ] = f(0). We then have / = f(0) = f(f(k)) = f(0)? +k = P +k, 
whereas, letting m = k and n = 0 in the stated relation, we get k* + 1 = k; 
hence, k = / = 0. In what concerns (b), make m = | andn = 0 in the given 
relation to obtain f(1) = 1; then, deduce that f(f(m) + 1) = n+ 1 and, hence, 
that f(n) = n for every nonnegative integer n. Finally, extend the arguments 
above to the negative integers, thus showing that f(n) = n for every n € Z. 

Let x = y = | to conclude that f(1) = 1. Then, successively show that 
fFGy)) = ryfoy) = fFCOFO)), fy) = fOOf() and FO?) = FO), 
for every x,y € N. Now, if f(x) < x? for some x € N, write f(x)? = f(x) > 
ff (x)) = xf (x)? to arrive at f(x) > x*, which is a contradiction; analogously, 
show that one cannot have f(x) > x? for some x € N, so that f(x) = x? for 
every x EN. 
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Write x — 2 in place of x e 2 in place of y in (a) to conclude that f(x) = 0 for 
x > 2. Now, let x = y = 0 in (a) to get, from (b), that f(0) = 1. Finally, for 
2 


0 < x < 2, write 2 — x in place of y in (a) to obtain f(x) = 55; on the other 


hand, show that f (« + id) = f(2)f(x) = 0 and, hence, that f(x) < — for 
each one of those values of x. 

Firstly, notice from (b) that f has at most one fixed point in each one of the 
intervals (—1,0) and (0, +00). If there exists a fixed point xo of f such that 
—1 < x9 < 0, let x = y = Xp in (a) to conclude that a + 2xo is another 
fixed point of f and, hence, that xe + 2x9 = xo, which is a contradiction. Argue 
analogously to conclude that f has no fixed points in the interval (0, +00). 
Finally, set x = y in (a) to conclude that f(x) = — 35 for every x € S. 

Start by observing that, if such anf there exist, then f(F;,) = Fx+1 for every k € 
N, where (F,)n>1 is the Fibonacci sequence. Then, use Zeckendorf’s theorem 


(cf. Problem 30, page 98) to show that one possibility for f is to have 
fly Pig sy) = Ft Fg et Pts 


for natural numbers j and | < i, < iz <-+- < ij. 

If f is such a function, set y = 0 in the given relation to get f(z) = (c + 1)z for 
every z € Im(f), where c = f(0). Then, use this fact to deduce that cf(x+y) = 
Ff) — xy for all x, y € R. In order to show that c = 0, suppose that c 0 
and conclude that 0 ¢ Im(f). Now, make x = c and y = —c in the last relation 
above to obtain f(c)f(—c) = 0 and, then, arrive at a contradiction. Therefore, 
we have got f(x)f(y) = xy, for all x, y € R. In order to finish, take yp € Im(f) 
for which f(yo) 4 0; then, f(vo) = (c + l)yvo = yo and, hence, f(x) yo = xyo, 
for every x € R. 

From (a), notice that f(x + k) = f(x) + k, for every k ¢ N. Now, take m,n €¢ N 


2 33 
and use the fact that as le € N to compute 


in two different ways, thus obtaining the equality 
(at+n’)? = a + (n° + 3n°m + 3m’), 


where a = f (“). Then, conclude that f(x) = x, for every x € Q}. 
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1. For the second part, show that the x-coordinates of the intersection points of the 
graphs are the solutions of the equation f(x) = x. In our case, the only such 
solution is x = ae 

2. If |f(x)| < M for every x € J and (xo, yo) € Gy, then |yo| = [f(xo)| < M, so that 
—M < yo < M. Hence, (xo, yo) belongs to the horizontal strip of the cartesian 
plane bounded by the parallel lines y = —M andy = M. 

3. Since the bisector of the odd quadrants of the cartesian plane is the set of points 
(x, y) for which x = y, we conclude that xo € / is a fixed point of f if and only if 
f (xo) = xo, i.e., if and only if (xo, x0) € Gy. Therefore, the fixed points of f are 
exactly the abscissas of the points where the graph of f intersects that bisector. 
In Fig. B.1, if f is increasing for x < x, and decreasing for x > x3, its fixed 
points are x;, x2 and x3, which, in turn, are the abscissas of the points A, B and 
C, respectively. 

4. Such a point (x, y) satisfies x € J and y = f(x) = g(x). Therefore, it suffices to 
solve, for x € J, the equation f(x) = g(x). 

5. Let’s prove item (a), the proof of (b) being totally analogous. If f is even, then 
f(x) = f(—*) for every x € J. Hence, for x € J, we have 


(x,y) € Gr Oy =f(x) oy =f(-x) > (-x,y) € G, 


However, since the points (x, y) and (—x, y) are symmetric with respect to the 
vertical axis, we conclude that the same is true of Gy. 

6. Functions f) and f4 are always nonnegative and vanish only at x = 0. Also, 
it is clear that they are even functions, so that (by the result of the previous 
problem) their graphs are symmetric with respect to the vertical axis of the 
cartesian system. On the other hand, as |x| increases to +00, it’s evident that 


Fig. B.1 Fixed points of a function f 
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B.2. Graphs of f and f4 


the values of f) and {4 become bigger and bigger, eventually surpassing any 
predefined value. Finally, 


jx) 1 oxt<x, fx) >1oxt> x and [x] =1 3x4 =X =1, 


justifying the fact that the graph of 4 is situated below (resp. above) the graph 
of f2 in the interval (—1, 1) (resp. outside the interval [—1, 1]). See Fig. B.2. 

Functions f3 and f5 are positive for x > 0, negative for x < O and vanish 
at x = 0. Also, since they are odd functions, their graphs are symmetric with 
respect to the origin of the cartesian plane. Now, as |x| increases, it is clear that 
the values of |f;| and [f5| become bigger and bigger, eventually surpassing any 
predefined value. Finally, 

|x] <1 > [| < |? ], [x] > 1 > [| > |?) and [x| = 1 > [2° | = |[x'], 
which justifies the fact that, in the interval (—1, 1) (resp. outside the interval 
[—1, 1]), the graph of f5 is closer to (resp. further away from) the horizontal axis 
than the graph of 3. See Fig. B.3. 


. Note that f is the inverse of the function g : R > R such that g(x) = x°, for 


every x € R. Then, apply the result of Proposition 6.63. 


. Verify that the graph of the integer part function is the union of the sets [n,n + 


1) x {n}, when n varies in Z. 


. For item (a), start by observing that, for x € [0,p), the periodicity of f 


guarantees that f(x + kp) = f(x), for every k € Z. Hence, (x,y) € GF @ y= 
f@ @&y=ftkp) > («+kp,y) € Gy. Then, letting F = G;N ((0, p) x R), 
we have Gr = U rez(F +k), where F + k denotes the translation of F, of k 
units, in the direction of the vertical axis. For item (b), note that {x} = x for 
x € [0,1). Since {-} is periodic of period 1, it follows from (a) that Fig. B.4 
sketches the graph of {-}. 
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Fig. B.3. Graphs of f3 and fs 


Fig. B.4 Graph of x b> {x} 


10. For the item (a), it suffices to show that (x, y) is on the graph of f if and only if 


11. 


(x — a, y) is on the graph of g. Indeed, if (x, y) belongs to the graph of f, then 
f(x) = y and, hence, g(x — a) = f((x — a) + a) = y, so that (x — a, y) belongs 
to the graph of g; moreover, the converse statement can be established in an 
analogous way. For item (b), it suffices to show that (x, y) is on the graph of f 
if and only if (x, y + a) is on the graph of g. Actually, if (x, y) belongs to the 
graph of f, then f(x) = y and, hence, g(x) = f(x) +a = y+a, so that (x, y+a) 
belongs to the graph of g; as in the previous case, the converse statement can 
be proved analogously. The other items can be similarly dealt with. 

At a first glance, it may seem that we cannot use the results of the previous 
problem, for, the domain of the function under scrutiny is not the set R of real 
numbers and its expression doesn’t match any of the ones considered there. 
Nevertheless, since 
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B.5 Graph of x > 


= 


2x _ ae 22 2 
x+1 x41 ~ x41? 


we can reason in a way analogous to that of the previous problem, thus 
sketching the graph of f in the following way: firstly, we sketch the graph of 
xb +; then, we translate it one unit to the left, thus obtaining the graph of 
xb = secondly, we stretch the previous graph in the vertical direction, by 


a factor 2, thus obtaining the graph of x bh ne the result along 


oi : 
the horizontal axis, we obtain the graph of x b> —: finally, if we eu 
translate the reflected graph two units above, we obtain the graph x +> =. The 
final result is shown in Fig. B.5. 

For the first part, show that, with the graph of f at our disposal, we can get the 
graph of g by reflecting, along the horizontal axis, the portion of the graph of 
f situated below such axis. For items (a) and (c), apply this pyeceaire to the 
graphs of the functions f(x) = x* — 4, f(x) = ser and f(x) = x —|x+2)+ 2, 

respectively. Also notice that, according to piablers 10, the graph of f(x) = = a 
can be obtained by translating me graph of x + one unit to the left. In what 
concerns the graph of f(x) = x? — |x + 2] 4+ 2, take a separate look at the cases 
x < —2 and x > —2, observing that |x + 2| is respectively equal to —x — 2 and 
x+2. 

Apply a (clockwise) rotation of 7 radians to the original cartesian system (this 
is equivalent to a counterclockwise rotation, of 7 radians, of the hyperbola of 


ral 
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equation x?—y? = 4). By standard analytic geometry (see, for instance, Chapter 


2 
(3) = 2/2 or, which 


2 
: : xy\> _ 
6 of [4]), we obtain the curve of equation (3) Wes 


is the same, xy = 1. 


Section 6.7 


1. For the first part of item (a), let A = (1,0) and P be the point on the 


trigonometric circle such that AP = x (of course, in the counterclockwise 
direction). Since sin x is the ordinate of P, if x,,x. € [-%, ZI, with x} < xo, 
we clearly have sinx; < sin x2, so that the restriction of the sine function to the 
interval [-4, Z| is increasing and, as such, injective. Since this restriction has 
image [—1, 1], the arcsin function is well defined and, by Problem 8, page 176, 
is also increasing. The first part of item (b) can be dealt with in a similar way, 
observing that cos x is the abscissa of P. Finally, the first part of item (c) follows 
ut 


from the fact that, for x € (-4, z), the real number tan x is the ordinate of the 


intersection point of the half-line OA with the vertical line passing through A. 
2. For item (a), observe that cotx = —tan (x + a) and use the results of 
Problema 10, page 194. For items (b) to (d), use the result of Proposition 6.63. 
3. For items (a) and (b), we refer the reader to the discussion of Example 6.68, 
more precisely to Eq. (6.15); as for item (c), we suggest the reader to recall 
items (a) and (e) of Problem 10, page 194. 
4. Argue as in Example 6.68 to get f(x) = Ja? + b* cos(Ax — a), where cosa = 
4 and sina = ——. Now, show that f ha period = 


Var +b a+b? Aq’ 

5. For item (a), start by using trigonometric identities to get f(x) = —2 sin’ x + 
2 sinx+ 1. Then, observe that the quadratic function y + —2y?+2y-+ 1 attains 
its maximum value at y = 3, increases on [-1, 5| and decreases on [5. 1]. Item 
(b) follows from the expression obtained above for f(x), together with the fact 
that the sine function has period 27. Finally, for item (c), sketch the graphs of 
x» 2sinx and x  cos(2x) on the interval [0,27] and in a single cartesian 
coordinate system; then, add corresponding ordinates of the points of these two 
graphs to get the desired sketch of the graph of f. 

6. Use the result of Problem 4, page 193. 

7. Perform the change of variable x = /5 cos 6, with 0 < 6 < az. Then, use 
the discussion of Example 6.68 to conclude that f(x) = 25 cos(@ — a), with a 
satisfying cosa = 2 and sina = 4. 

8. Suppose, by the sake of contradiction, that f is periodic, of period t > 0. Then, 
we must have f(t) = f(0) = 2. Use this equality, together with the fact that 
the cosine of every real number is at most 1, to get cost = cos(at) = 1. 
Conclude that there must exist (nonzero) integers k and / for which t = 2kx 
and at = 2/z, and arrive at a contradiction. 

9. Initially, show that the equality f(x + 32) = f(x) implies the equality 
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5 15 5 
(—1)” sin (= + =*) = sin (=) ; 
n n n 


whenever nx 4 5 + kx, for every k € Z. Then, if n is odd, for instance, show 
that the last equality implies sin (= + br) cos (+) = 0, for every x € R such 
that nx # + + kx, for every k € Z. Finally, conclude that n divides 15. 

Suppose that the given function is periodic of period p > 0. Compute f(x + p) 
with the aid of the standard trigonometric identities for the sine of the sum of 


two arcs and, then, take a careful look at the equality f(x + p) = f(x). 


Section 7.1 


1, 


OonNDN 


For item (a), observe that (an) = mE = x. For (b), argue in an analogous 
way. Item (c) now follows immediately from the results of Problems 6 and 8, 
page 176. Finally, for item (d), Problem 6, page 176, gives %/a > #/a if and 
only if a” > a’; the rest follows from Corollary 1.3. 


. For item (a) suppose, by the sake of contradiction, that a > 0. Then, by the 


Archimedian property of N, we can choose n € N such that n > 1. This is 


equivalent to a < ‘, which is an absurd. For item (b), choose n € N such that 
ra. 


a 
. For item (b), let 71,...,1% € (a,b) N Q, with rj,...,7 pairwise distinct. If 


r = min{r),...,7}, then r € (a,b) MN Q, and item (a) gives a < <r. 
Finally, argue similarly, using the second part of (a), to get infinitely many 
irrational numbers in the interval (a, b). 


atr 
2 


. For item (a), suppose that we already have the truth of the result for a > 0. If 


b < 0, then —b > 0 and, by item (a), there existe r € Q anda € R \ Q such 
that r,a@ € (—b, —a). Hence, —r,—a@ € (a,b), with -r € Qand-—a —€ R\Q. 
If a < 0 < Bb, show that it suffices to apply, to the interval (0, b), the result we 


assumed to be known. For item (b), use the Archimedian property of natural 
numbers to get n € N such that n > wa Finally, in what concerns (c), starts by 
using the Archimedian property to guarantee the existence of m € N for which 
m 
n 


> b (resp. m2 > b); continue by showing that, if m is the least possible 


natural number fullfilling this requirement, then m > 1 and ete € (a,b). 


. Adapt the proof of items (b) and (c) of the previous problem to the present case. 
. Use the result of Example 7.2, together with Theorem 7.4. 

. Use the result of item (b) of Problem 4. 

. Fora =  andb = K? +1, we have |./a—/| = 


1 1 
2 SS 
JatJb Se F1 4/2 ~ 2k 


Since every element of X is positive, it follows that inf(X) = 0. 


. Observe that, for k €¢ N, every number of each of the forms + and 1— + belong 


3 
to C. Now, use the fact that x < +, for every k EN. 
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10. Adapt, to the present case, the proof of Proposition 7.6. 

11. Use the result of Proposition 7.6, together with the result ot Problem 10, to 
establish the sar abl of x, € X and y, € Y such that a — x < X, < a and 
a<yy<at+ x: 

12. For item (a), show that, if w (resp. 6) is an upper bound for X (resp. for cX), 
then ca (resp. B) is an upper bound for cX (resp. for X). The other cases can be 
dealt with similarly. 

13. For item (a), start by showing that, if @ and 6 are upper bounds for X and 
Y, respectively, then w + 6 is an upper bound for X + Y; then, conclude that 
sup(X + Y) < supX + sup Y. Show next that, if sup(X + Y) — supX < sup Y, 
then Y would have an upper bound less than sup Y. 

15. The given conditions imply that R, has vertices (0,0), (an,0), (0, bn) and 
(an, bn), for some integers a, and b,. Passing to a subsequence, if necessary, 
we can assume without loss of generality that a,,b, > 0. Show that one can 
assume, also without loss of generality, that aj < a2 < a3 < ---. Then, show 
that one cannot have bj > bo > b3 >-::-. 

16. Since |f(x)| < 1 for every x € R, we can let L = sup{|f(x)|; x € R}. Suppose 
that sup{|g(x)|; x € R} > 1 and apply triangle inequality to the relation given 
in the statement of the problem to reach a contradiction. 

Section 7.2 

1. Show that, if a > b, then we would have a, > ae > b, for every sufficiently 
large n. 

2. Write |./a; — J/a| = en < salen — aj; then, use the convergence of 
(Gn)n=1 to a. 

3. Forn > 16a’, show that let < a < (24)" < 4. 

4. Firstly, note that 


as n — +00. Hence, fixed a real number a@ such that al <a < 1, there exists 


no € N for whichn > np > <Q, or < a”. However, since a” —> 0 


i in 


as n — +00, the same happens with + 
a > 0, and note that, for n > k, 


n_ n_ “ n j n k+1 
lal” = (1+ a)" = X (‘)e > G ie 


n(n—1)...(n—k) git! 
(k + 1)! : 


; ~. Alternatively, let Ja| = 1+ a, with 


qe 
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. For item (a), write 
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Hence, for n > k, we have 


nk (k+ 1)! nk 
la|” akt+l — n(n—1)...(n—k) 
_ (k+))! 1 


ort ad =D 2) 


with the last expression above tending to 0 as n — +00. 


. Write / = (1 — #,)! + t,/ and use the triangle inequality to get |c, — J] < 


(1 — t,)|adn — L| + thlbn — 1]. Then, show that Ja, — I|,|b, — | < € implies 
ICn —I <e. 


. First of all, note that |b, — | < max{|a, — 1, |cy — |}, for every n € N. Hence, 


given € > 0 and taking np € N such that n > np => |a, — I, |cn — 1] < €, we 
have |b, — 1| < € forn > no, so that b, =], 


nJ/n 1 I . 
Ll Vite < Wik For (b), write 


an +b a+* 
Vr +an+b-121=——~— = ——*_ 
Vn+an+b+n +4454 


For item (c), observe that, if a, = Y1+ q", then a, > 1 and, by item (a) of 
Problem 18, page 26, 


ge 
a at eo ae 1 


Finally, for (d), write Va? +b" = ay/1+ (2)" and use the result of the 


previous item. 


0<a,-1= 


. Adapt, to the present case, the reasoning that led to the proof of Proposi- 


tion 7.18. 


. For the first part, write q = 1 + a, with a > 0, and note that a, = (1+ a@)" > 


1 + na, so that a, > M forn > Mo. 


. Letb= Bi so that b > 1. Use the first part of the previous problem to get k ¢ N 


such that b§ > a. Then, show that 2” > kn for every sufficiently large n and, 


from this, conclude that a"q*" = (4)" . ooh — 0 whenn > +00. 


— + ait; then, make k — +00. 


First, show that |am,— dn| < - 
n+ 
Observe that a,4; = /1-+ an, for every n > 1. Then, successively conclude 
that a7,; — a, = a, — a, and that (a,),>1 is increasing. Next, note that 
: : l+ay I+4y, 
as = | +a, implies a,4) = a < aa < 2, and apply Bolzano- 
Weierstrass theorem to guarantee the existence of / = limy-+ +00 dn. Now, make 
n — +00 in the recurrence relation a,4,; = 1 + a, and use the result of item 
(a) of Problem 2, together with item (b) of Proposition 7.18, to get the equation 


l=VJ1+41. 
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13s 


14. 


15. 


16. 


17. 


18. 


Setting @ = min{a},a2} and B = max{aj,az}, conclude that (ay_)p>1 is 
bounded (to this end, separately consider the cases0 <a < 6B <4,4<a< 8 
and0 < a < 4 < 8). Now, if a, < @m < agora, > @ => 4a, 
write a42 — ak41 = /ax+1 — ./ax—1 to conclude that the given sequence is 
monotonic, thus convergent; make k — +o0 in the recurrence relation satisfied 
by the sequence to conclude that a, at 4.Ifa, < a3 < Mora, > a3 >, 
conclude, as above, that the subsequences of the terms of even and odd indices 


are convergent, say, @ox—1 ety c and ax, pare d; then, make k — +00 in the 
given recurrence to conclude that d = /c¢+ Jd andc = Jc + V4, so that 
c=d=4. 

Use the fact that t, = —fo —t; —---— t,-1; = 0 to write 


ag = (Vk — Vk +n) + (Vk + 1-— Vk Fn) +--- 
tthi(Wv¥ktn—-—1—-—Vvk+n) 


1o ty 
Vkt+Jk+n Vk+14+Vk4+n 
tn—-1 


~ “Jketn—1+Jkpn 


Then, make k > +00. 
Perform the trigonometric substitution x, = 2.cosy,, with y, € [0, +], and use 
some Trigonometry to conclude that y,41; => 2y,, for every n > 1. Then, show 
that y, < “ft < sex for every k > 1, so that y, = 0 for every n > 1. 

First of all, use the given condition to show that |(@n4.1—4@n) — (441-4) | < 2. 
Then, make n + +00 to conclude that there exists / = lim,-+4.0(Gn41 — Qn), 
and show that a,,, — a, = /, for every k € N. 

Use the definition of a, to conclude that a,_; + 1 = pha, for every integer 
k > 2. Then, show that 


2k + 1)2 
eee Tee) 


a1 — (ay — an41) 


e 


and conclude that the sequence (a,),>1 is decreasing for n > 2. Finally, make 


k — +00 in the equality a,_; +1 = au to conclude that a, — 1 when 
n— +0. 
Successively show that a = k+ a,_, and ae 417 ae = dy — Gn—1; then, 


conclude that (a,),>1 increases. Now, use the fact that ae < k+ ay to get 


an < (1 + /4k + 1) for every n > 1, so that the given sequence is convergent. 
Make n — +00 in a? = k + ay_ to find out a, = (1 + /4k + 1). From 
here, items (b) and (c) are relatively easy. 


586 


19. 


20. 


21. 


22. 
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First of all, use the inequality between the arithmetic and geometric means of 
two numbers to conclude that a,4,; > ./a, for every k € N. Now, make k > 
+oo in the given recurrence relation to conclude that, if (a,),>1 converges, then 
its limit equals ./a. To what is left to do, use the triangle inequality to obtain 


1 a a 
lax41 — ax] = = |x — ag-1 + — — — 
2 an Ak-1 
1 a 
= = lag — ag-1|- 1- 
2 AKAk—-1 


< =/dy — Ag-1 |, 
S glee kt 


for each k > 2. Finally, establish the convergence of (a,),>, from the result of 
Example 7.28. 

Let a), a2, ..., A, be the initial terms of the AP’s, and say that the i-th AP is 
that of initial term a; and common ratio d;. Choose n € N greater than all of 
the a;’s, and let k; be the number of terms of the i-th AP belonging to the set 
{1,2,...,m}. Then, on the one hand, k} + ky + +++ +k», = n; on the other, 
a; + (kj — 1)d; < n < a; + kjd;. Use these relations to show that 


n 


S-m Ql K) 
1 < —<l14+-, 


where S = ea “), Then, make n > +00. 
Suppose such a function does exist, let k > 1 be an integer and x; = i for 
0 <j < k. Given areal number x € [x;, xj+1], show that d(f(x), f(x)) < aii 
Letting C; denote the circle in the plane with center f(x;) and radius Preeti 
conclude that x € [xj,xj41] => f(x) € Cj. Then, use this fact to show that 
1< Eo A(C)) = ae Finally, use the fact that k > 1 can be chosen arbitrarily 
and a > 0 to reach a contradiction. 

Start by proving that a,4; > ad, + 1 and, hence, that a, > n for every n > 1. 
Now, use telescoping products to show that, for a fixed p € N, we have 


pl 1 
eas ae I] (: + 
an -0 Jf Anti 


and, then, 


p-l p-l 
n n+j 1 
p< fee <[] <T (14 ) 
an / ; 


Geis 
j=0 n+j j=0 


Continue by showing that 


B_ Hints and Solutions 587 


22. 


lim, ( a (1+ see) (1 5] =I. 


Now, if y = 1, conclude that what is asked to be proved follows from the limit 
above. If y < 1, conclude from the limit above that we can choose n, € N such 


thatn > n, > 7s > y. Use the fact that a, > k for every k to show that we 
n+p 


can choose a natural number no such that 


1 
T+ Jam tt <y Xx. Then, note that 


n > no and p natural furnish 
an 


an 
Antp  Antp + ./Ant+p 


; establish the existence of a natural number 


<y-x. 


Fix k > no, M, 
Po > p for which 

ak ak 
z=y> : 
Ak+po Ak+po+l 


Finally, suppose that < x and arrive at a contradiction. 


ot a 
Let co, C1, C2,... be defined by co = 1 and 


Rewriting such a relation as Cy = Gy—1Cn—1 — AnCn, We get the telescoping sum 

Cy ten tee + Cy = AQ — AnCn. (B.3) 
On the other hand, the assertion of the problem is equivalent to —— < 27 Un for 
infinitely many values of n € N. Arguing by contradiction, suppose that there 
exists a natural N such that the opposite inequality is true for every n > N. 
Then, for n > N we have 


1 1 1 
on & oy Dw mbt) = CHS 


where C = cy -2(!+2+~+7) ig a positive constant. If 2! <n < 2, then 


ik deat + eee ae fee Lees ! 
at ~ 2. <3 4 7 oe) 2k— 1 
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so that c, > C-27* for 24! <n < 2. Let r € N be such that 2’"! < N < 2’, 
and let m > r. Then, 
Cor Corp Pett Comey = 
= (Cor bees + Corti_y) + (Corti ++ + Cprt2_1) 
fesse (Com-1 fee Com_1) 


SOu 2 S grt, QAtae ees 4 pal ma 
_ C-@=r) 
— 5) ; 


thus showing that the sum of the c,’s can be taken arbitrarily large. How- 
ever, (B.3) guarantees that this sum cannot exceed dy. This contradiction 
finishes the proof. 


Section 7.3 


2. For (a), note that there are more numbers of the form {ja} than intervals of the 
form [ =). For (b), take 1 < s < t < p+ 1 such that {sa}, {ta} € [E =) 
write {sw} = sa—|sa]|,{tw} = te—|ta| and observe that 0 < |{sa}—{ta}| < a 

3. Use Corollary 7.33 to inductively construct a sequence (ax),>1 such that a, = 
my + nga, for integers m, and n,; satisfying item (a), and such that |a, — l| < i 
for every k > 1. 


4. For item (c), choose A and B in z such thatO ¢ AB, a € R \ Q and make 


X = {(m + nyar)OA + (m2 + noo)OB: m,,™M2,N1,Nz € Z}. 


For item iii., conclude that there exist distinct lines r and s through O, such that 
XNr,X 1s 4 O and X M1 ris not dense in r. Show that one can reconstruct X 
from X M rand XM s, apply Kronecker’s lemma and make d = s. 

5. For item (b), write n = 2 + z and take integer and fractionary parts. For (c), 
argue by contradiction, using the fact that w and f are irrational to show that one 


cannot have either 2} =l|- + or {3} =l|- ze 


Section 7.4 
aie 
1. Use que fact that " 7" =— ==] — for every n € N. 
2. Use the result of Problem 5, page 84, to show that )7~, —— = ©, for 


k=] aan adn” 
every integer n > 1. Then, make n + +00, noting that a, = a; + (n— I)r. 
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3. 


10. 


. Note that Vk + Vi? —1 < V2k, so that 2 a a 


Make S, = )-j=1 a 


1 1 1 2n— 1 
(a—1)S, = aS, —Syp =a+2{- ee _ 
a a2 qn} qn 
2a 1 1 2n—1 
=at a 7 
a-—-l\a a a" 


Now, use the results of Example 7.12 and of Problem 4, page 218, to conclude 
n 
that S, — 4 + Gp. =a 
> Dei Ge. Since 
aS V2 
a 25 ! for n > 1, use the comparison test i show that the given series 


diverges. 


. Show that n? — 1000n? > (n — 500)? for every sufficiently large natural n. 


Then, use the comparison test, together with the convergence of the series 


1 
e500 (k= 500)3/2 . 
1 


. Let r > O be the common ratio of the AP. For item (a), we have — = 


ak 
atic > ae provided k > = + 1. For item (b), we have 


1 1 1 1 


— = XX XK — . 
ae apt (2k—-1)r (2k -1)r 7 2k r 


Now, apply the comparison test. 


. Start by showing that 


for some natural number NV. Then, make N — +oo and use the formula for the 
sum of a geometric series. 


. Use the comparison test to show that the series yi >I — _ converges. Now, letting 


x denote its sum, note that, for k > n, we have 


For the first part, suppose that, for some x € (0, 1), we have x = 0.ajaa3... = 
0.b;b2b3..., with a, ~ 0 for infinitely many values of n and, accordingly, 
b, # 0 for sume: many values of n. Then,  < Doysi tk = DLs ae < 
a 1+ oo i at so that a; < b;. Analogously, show that a; > b,, so that 
ae = b,. Then, argue similarly to establish, by induction on n, that a, = by, 


for every n > 1. In what concerns the second part, define f(x) = (y, z), with 


590 


Lh 


12. 


13. 


14. 


15. 
16. 


17. 


18. 


19. 
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y = 0.aja3a5... and z = O.aza4ao... if x = 0.aja2a3..., with a, # 0 for 
infinitely many values of n. Then, use the result of the first part and the former 
problem to show that f is well defined and is a surjection. 

Use the fact that az + eat > 2a , for every k € N; then, apply the result of 
Proposition 7.39, together with the comparison test and Proposition 7.48. 

Use the fact that /apayay < 5 (ay + a,x+41) for every k € N, together with the 
comparison test. 

Imposing that F; > a, for every j < k+ 1, deduce that Fy42 > ak? ifa+1> 
a”. Use this a priori estimate to show that F, => a” for every n => 3, where 
a = min{/2, 73, L/S}. Finally, note that a > 1 and apply the comparison 
test. 

For item (a) (the proof of item (b) is analogous), let g = HA € (0, 1) and take 
no € N such that Yan < qforn > no or, which is the same, an < q" para 
n > no. Then, adapt the reasoning used at the proof of the ratio test. 

Apply Theorem 7.53. 


Start by eres that the series )> ie: CI ig absolutely convergent. Then, let 
a= YS (“)" and use the fact that e = ge q together with Theorem 7.53, 
to getae = iL 


For item (a), let 5, = p=) ae and th = )-4=1 Ay, 80 that 5) < 52 < 53 ee 
s, where s = sup{s,; n > 1}, and analogously for (t,),>1. Given n € N, prove 
that there exists m € N such that s, < ¢,,; then, make n — +00 to conclude 
that s < t. Change the roles of the two sequences to get the reverse inequality. 
For item (b), note that |a,| = ay + a,, so that at = $ (lan + An) anda, = 
$ (lanl — a). For (c), apply the result of (b) to both 7, , aj and wei ae 
By the definition of convergent series, it suffices to show that the sequence 
(Xn)a>1, given forn > 1 by x, = ayb) + anbz +--+ + anbn, is convergent. As 
we know, this is the same as showing that it is a Cauchy sequence. So, let m 
and n be natural numbers, with m > n. Let M > 0 be such that |s;| < M for 
every k > 1. It follows from Abel’s identity (5.19) (with the roles of a; and b; 
exchanged) and from triangle inequality that 


m—1 


[Xm a Xn! = | Y > si(bibin 1) - SmDm —_ SnDn 


m—| 


s es |si|(Didi+1) + |Smlbm + |Snlbn 


i=n 


= M(by, = bm) + Mb, + Mb, 
= 2Mb, —> 0. 


Hence, (X,)n>1 is indeed a Cauchy sequence. 
In the notations of the former problem, change a; by (—1)* and by by ag. 
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20. 


21. 


22. 


For item (a), observe that 2 sin(a + jh) sin 4 = cos(a+ (j — 1)h) —cos(a + jh). 
Now, sum from j = 0 to j = k and transform the result in a product. For item 
(b), note that it suffices to show that the sequence (s,)n>1, given forn > 1 by 
Sn = ae sink, is bounded. To this end, apply the result of item (a) to show 


ie fe nod 
that s, = —2—— and, hence, that |s,| < —. 
sin 4 sin 4 
2 sin 5 


Letting a, = AnOAn-+1, we have sina, = a Now, use the unit circle to show 
that sina < @ for every a € (0, Z). Finally, use the divergence of >> kel 74 to 
show that }°,., a; also diverges. 

Letting a, = 2B,, it suffices to show that 8; + B2 + 63 +--- = 4. Equivalently, 
letting b, = tan(B; +---+ B,) forn > 1, it suffices to show that que b, si, 
2(n+1)*b, +1 
QG+le—h,’ for everyn € N. 
for every n > 1. 


Use some Trigonometry to show that b,4, = 


n 


Then, use induction to prove that b, = ar 


Section 8.1 


~ 


. For the second part of item (a) note that, if a < 0 < b, then the image of the 


interval [a, b] by f is the union (—oo, +) U {0} U (;. +00). For item (b), start 
by computing the image of the interval [-3; 0] by f. 

Suppose that g is continuous, and let g(x) = c. For x € I \ {xo}, it follows 
from triangle inequality that |c—f(xo)| < |c—g(x)| + |g) —f(%o)| = lg@o) — 
g(x)| + |f(x) —f(x0)|. Now, use the fact that f and g are continuous to conclude 
that |c —f(xo)| < €, for every € > 0. 


. For items (a) and (b), use the result of the previous problem. For items (c) and 


(d), use the definition of continuity to conclude that there exists no value of c 
that turns the given function into a continuous one — it may help to sketch the 
graph of the function. 


. Start by observing that the set (x) —6, x9 + 6) x (f(%o) —€,f (xo) + €) is an open 


(i.e., with its boundary removed) rectangle centered at Po(xo,f(xo)) and with 
sides parallel to the axis. Then, recall that every disk centered at Pp contains an 
open rectangle centered at Py and with sides parallel to the axis, and vice-versa. 


. Note that, in D, the tangent function is the quotient of the continuous functions 


sin and cos. 

Use the chain rule for continuous functions. 

Use the chain rule for continuous functions. 

For item (a) adapt, to the present case, the proof of the continuity of the square 
root function at x) = 0. For (b), make y = 4/x, yo = 2/%o and apply item (a) of 
Problem 18, page 26, to get 


ly” — yol 


D0 SE yy IT 
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12. 


13. 


14. 
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Finally, for item (c), use the result of (b) to adapt, to the present case, the proof 
of the continuity of the square root function at xo. 


. Use the result of the previous problem, the chain rule and Example 8.10. 
. Use the chain rule to show that f is continuous at every xo ~ 0. Then, use the 


definition of continuity, together with the fact that |x sin +| < |x|, to show that f 
is continuous at 0. 

Fixed xo € [0, 1], take e = 5 and assume, by contradiction, that f is continuous 
at xo. By the definition of continuity, there exists 6 > O such that x € [0, 1] 
and |x — xo| < 6 imply [f(x) —f(@o)| < 5. Now, separately consider the cases 
xo € Qand x ¢ Q, and use the result of Problem 4, page 206, to arrive at a 
contradiction. 

In order to show that f is discontinuous at every rational number of the 
interval [0, 1], reason as in the hint to the previous problem, using the result 
of Problem 4, page 206. If xo € [0, 1] is an irrational number and ng € N, show 
that there exists 6 > 0 such that every fractional representation of a rational 
number of the interval (xo — 6, x9 + 6) has denominator greater than no; then, 
conclude that |f(x) —f(xo)| < =, for x € [0,1] N (x — 6, x0 + 4). 

The first part follows from the fact that f(x) is an upper bound for the set on the 
left hand side, whereas is a lower bound for the set on the right hand side. For 
the second part, use the definition of continuity, together with Proposition 7.8. 

That f is well defined follows from Problem 17, page 243, together with the 
fact that the geometric series >)”. x converges. For item (b), show that f(x) — 
fm) = a if x > X, so that each x,, is a point of discontinuity of f. Now, let 
be given x ¢ D and € > 0. Take mo € N such that a < € whenever m > mo; 
then, there exists 6 > 0 such that {x1,..., Xmg}M(x—56,x) = @. In other words, 
Xm € (x — 6, x) => m > mo, and this gives that, for y € (x — 6, x), 


1 1 1 1 
fa-fo= Loss LU gs L wm 


YSXn<x X58 <Xp_) <x m>mo+1 


Analogously, show that f(y) —f(x) < € ify € (x,x+6) and 6 > O is sufficiently 
small. 


Section 8.2 


1. 


First of all, recall that Problem 12, page 152, gives f(x) = f(1)x for every 
x € Q. Then, fix an irrational number xp € R and use the result of Problem 4, 
page 206, to find a sequence (a,),>1 in Q satisfying a, s&s Fp. Finally, apply 
Theorem 8.16. 
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. Start by using the second hypothesis to show that f is injective; to this end, take 


distinct x, y € R and consider the divergent sequence (a,,),>1 such that ao, = x 
and az,.—; = y, for every k > 1. Finally, use the hypotheses on f to show that 
f —! transforms convergent sequences into convergent sequences. 


. Given € > 0 and letting c > 0 be the Lipschitz constant of f, show that we can 


take 6 = < in the definition of uniform continuity. 


. Note that, for every 6 > 0, there exist x,y € R such that |x — y| < 6 but 


lf (x) —f(y)| = 1; for instance, taking x > 0 andy = x+ 5 it follows from the 
binomial formula that |f(x) — f(y)| = («+ sy —x">n- s a my, and it 
suffices to take a sufficiently large x. 


. If f was uniformly continuous, to the given € > 0 there would correspond 


some 6 > 0 as in (8.4). Now, since |a, — b,| ay 0, we could choose n € N 
satisfying |a, — b,| < 5; hence, we should have [f(a,) — f(bn)| < €, which is a 
contradiction. 


. Use the result of the previous problem. 


. For item (c), consider g : [a,b] > R given by g(x) = WG’ for every x € 
[a, b]. 
. It suffices to let f(x) = ———,, for x € (a,b). 


(x—a)(x—b)’ 


. Initially, observe that the problem of computing the distance from P to Gy is 


equivalent to the problem of minimizing the continuous function g : (a,b) > 
R, given by g(x) = JV (x— x0)? + (f(x) — yo)”. Now, let O(xo, f(xo)) and, for 
0 <6 < b—a, let Rs = (a+ 6, f(a+ 5)) and Ss = (b —6,f(b — 5)). Use 
the stated condition to show that there exists 59 > 0 such that, for 0 < 6 < do, 
we have RsP, SsP > PO. Then, conclude that it suffices to minimize g in the 
interval [a + do, b — do]. Finally, apply Weierstrass theorem to finish the proof. 
For item (a), apply the triangle inequality to get 


x)—a An aye a 
PON gs EN ache ON a NN as ja 
* x = y ier 
For (b), note that |x| > 1 implies |x| < |x|? < --- < |x|""!. Item (c) follows 


from (b), taking A = max ie + pe ail} and observing that x” > 0 for 
x # 0, since n is even. Finally, for item (d), use the result of item (c), together 
with Theorem 8.26 and ap = f (0). 

Item (a) follows immediately from (8.8). In what concerns (c), on the one hand 


we have 
f(a) LS im Xn) , we Fn) , we Xnt1 = a 


on the other, if 6 € R is such that f(B) = 6, show that the inequalities |a—f| = 
f(a) —f(B)| < cla — B|, together with the fact that 0 < c < 1, givea = B. 
Setting f(x) = 5 (x + »/x? + 1), we clearly have f(x) < x for every real x. 
Moreover, 
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14+ —_-___. 
V¥e+1+ fy4+1 
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2 VP +14+/yr+1 
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Use the sign-preserving lemma, together with Problem 5, page 206, and 
Theorem 8.16. 

Show that the stated condition gives f(0) = f(m + nq), for all m,n € Z. Then, 
use Corollary 7.32, together with Theorem 8.16, to prove that f is constant. 


Section 8.3 


. We could only apply the IVT if the domain of f contained the interval [0, 2], 


which is not the case. 


. Apply the IVT in each case. Sketching the graphs of the involved functions may 


help. 


. Let y = ax + b be a non vertical line, so that a # 0. It suffices to show that 


the function g : R — R, given by g(x) = x sinx — ax — b, has infinitely 
many zeros. To what is left to do, suppose that a > 0 (the case a < 0 can 
be handled similarly); show that, for every sufficiently large k ¢ N, we have 
g(2kr) < 0 < g(2km + 5). 


. Look at the function f : [0, 1] — [0, 1] such that f(x) = x+(1—x)? sin x(n—2)x. 
. Adapt the argument in the proof of Bolzano’s theorem. More precisely, suppose 


I = [a, b] (the other cases can be dealt with analogously), fix a € X and let 
A = {x € [a,b]; [a,x] C X}, so that A # 9. If c = supA, take a sequence 
(@n)n>1 in A such that a, — c and use condition (ii) to show that c € A. If 
c < b, use condition (i) to choose 0 < 6 < b—c (corresponding to x) = c) to 
conclude that c + 4 € A, which is a contradiction. Finally, make an analogous 


2 
reasoning to show that [a, xo] C X. 


. Since x = 0 is not a solution, we can write the given equation as f(x) = 0, 


where f : (0,-+00) > R is the function given by f(x) = )7_, Vz fe re, 
Thus, we need to show that f has exactly one positive zero. To this end, since the 


ai 


function x b> ,/ + + “= is continuous and decreasing, and since f is a finite sum 


x 
of functions of this kind, we conclude that f is continuous and decreasing too, 
which forces it to have at most one positive zero. On the other hand, f(1) = 
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ye V1 +a —1n > 0; also, Vz +4 <1lifx> max{ J/2, 2a;}, so that 
f(x) < Oif x > max{/2, 2a),...,2a,}. Now, apply the IVT. 


. Let f : [0,1] > R be given by f(x) = 4 ¥“_, |x — xj]. It suffices to guarantee 


n 


the existence of x € [0, 1] for which f(x) = 5. To this end, note that 
(O22) Bee 
== xj an ape Meats! Xi, 
. a= a 


so that f(0) + f(1) = 1. If f(0) = fd) = 5, there is nothing to do. Otherwise, 


we can suppose, without loss of generality, that f(0) < 4 < f(1). Since f is 


2 
continuous, the IVT finishes the task. 


. Firstly, suppose that f(0) = 0. Making x = 0 in the given relation, we get 


fQ) = fO)F(2) + fC) = 0. Now making x = 1, we get f(2) = f(I)f(3) + 
f(2) = 0. This way, an easy induction gives f(n) = 0 for every positive integer 
n. Suppose, then, that f(0) > 0. It follows from f(0)f(2) + f(1) = 0 that f(1) 
and f(2) must have opposite signs. But then, the IVT guarantees the existence 
of areal number a é€ (1, 2) for which f(a) = 0. Arguing as in the case f(0) = 0, 
conclude that f(a + n) = 0 for every positive integer n. 


. Make x = 1000 and, then, use the IVT to conclude that the image of f contains 


the interval [a0 999]. Use the IVT once more to guarantee the existence of 
xo € R such that f(xo) = 500. Finally, make x = xo in the given relation. 

First of all, show that ifn € N, thena = + is one possible value. Then, if 
a# 1 for every n € N, construct a continuous function f : [0, 1] > R such that 
f(x +a) #f(@ for every x € [0,1 — a]. 

Use the given relation to arrive at a contradiction to the IVT. 

For item (a), the fact that f is a bijection follows from Example 6.40; then, 
Theorem 8.35 guarantees that f is increasing or decreasing. For item (b), note 
first that, from (a), we have f(0) = Oe f(1) = 1. If there exists x € (0, 1) such 
that f(x) < x, then the fact that f is increasing, together with the hypothesis of 
the problem, gives x = f(f(x)) < f(x), which is a contradiction. Analogously, 
we cannot have f(x) > x, so that f(x) = x is the only possibility. Finally, for (c), 
note first that, by item (a), we have f(0) = 1 and f(1) = 0. Then, fix a € (0, 1) 
and define f letting f(a) = a and the restrictions of f to both intervals [0, a] and 
[a, 1] being affine functions. It’s immediate to check that f(f(x)) = x for every 
x € [0, 1]. 

Example 8.31 shows that g has a fixed point xo. If x, = g(%»—1) forn => 1, show 
that the sequence (x,,),>1 is nondecreasing and such that f(x,) = x,, for every 


n> 1. Letting x, —"+ a, use Theorem 8.16 to show that f(a) = g(a) =a. 

Consider the auxiliary function g : R > R given by g(x) = ae f(x +Jr), 
where r > 0 is to be chosen. Then, if f(xo) < 0 < f(x,), apply the sign- 
preserving lemma to g to show that r can be chosen in such a way that g(x9) < 


0 < g(x,). Finally, invoke the IVT. 
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Start by observing that, for x € (0,1), the function g(x) = x(f(x) — f(0)) 
measures the area of the closed rectangle having (0,f(0)) and (x,f(x)) as 
endpoints of a diagonal and sides parallel to the coordinate axis. Hence, there 
exists 0 < x; < 1 such that this rectangle has area +. Arguing in an analogous 
way, we find rectangles Rj, ..., Rx, each of which having area + and sides 
parallel to the coordinate axis, such that R; U...U Rx contains the graph of f 
and is contained in the closed square [0, 1] x [0, 1] of the cartesian plane. Finally, 
use Cauchy-Schwarz inequality to show that k < n. 

Suppose, by the sake of contradiction, that there exist x9, yo € R for which 
ft (xo) < f(y0); moreover, suppose that yo € (xo,x9 + f(xo)) (the other cases 
can be treated analogously). Use the fact that f(x» + f(xo)) = f(xo) to show 
that it is possible to choose n € N and c € R such that the line r of equation 
x + ny = c leaves the point (yo, f(yo)) in a semiplane opposite to that of the 
points (xo, f(xo)) and (xo +f (xo), f (40 tf (x0) = (x0, (X0)). Now, show the IVT 
guarantees the existence of real numbers a € (Xo, yo) and b € (yo, xo + f(xo)) 
such that the points A(a,f(a)) and B(b,f(b)) are on the line r, i.e., such that 
a+nf(a) = candb+nf(b) = c. Finally, conclude that f(c) = f(a+nf(a)) = 
f(a) and f(c) = f(b + nf(b)) = f(b); hence, f(a) = f(b) and, then, a = b. 
Making x = y = 0 in the given relation, we get f(0) + g(0) = h(0). Thus, 
letting fi(x) = f(x) — f(), g1(x) = ge) — gO) and h(x) = h(x) — h(O), we 
have 


A@ty) + gi0? +y) = Gy), 


with f,(0) = g1(0) = 41(0) = 0. Hence, we can start by supposing that f(0) = 
2(0) = h(O) = 0. Making y = —x°? in the given relation, we get g(x — x°) = 
h(—x*) for every real x. Making x = —y?, we get f(—y? + y) = h(—y*) for 
every y € R; therefore, it follows that f(x — x°) = g(x — x°). However, since 
the image of the polynomial function x +> x — x? is the whole set of reals (see 
Example 8.28), we have f(x) = g(x) for every x € R. Then, the relation at the 
statement of the problem reduces to 


f(xty’) +0? + y) = hQy). (B.4) 


Performing the substitutions y = —x° and x = —y° in (B.4) and taking into 
account that f(0) = 0, we obtain, respectively, 


f@—x) =h(-x*) and f(-y? +y) = hy’), 


so that f(x — x°) = f(—x°® + x). Using once more the fact that the image of 
the polynomial function x > x — x? is R, it follows that f is an even function, 
ie., that f(x) = f(—x) for every x € R. Now, letting y = 0 in (B.4), we get 
f(x) +f?) = 0, so that f(x) = —f(x*). Back to (B.4), this relation gives us 
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fee +y) —f( + y°)?) = hy). 


However, the fact that f is even implies, then, 


f@ +y)-f(-@+y°)) = Gy). 


Therefore, if a € R is such that the system of equations 


has at least one real solution, then h(a) = 0. Let us prove that, for every a < 0, 
this is indeed the case: if a = 0, we already have h(a) = 0. If a ¥ 0, the the 
second equation of the system is equivalent to y? + 3y°x + 3x°y? + y+ 2x7 = 0. 
Writing x = ~, we conclude that it suffices to guarantee the existence of a real 


a’ 


number y such that 
y!? + 3ay® + (3a? + 1)y* + 2a? = 0. 


To this end, let p(y) = y!? + 3ay® + (3a? + 1)y* + 2a3. If a < 0, then p(0) = 
2a? < 0 and p(—a) > 0, so that the IVT guarantees that p has at least one 
real root. To finish, not that f(x — x?) = h(—x*), so that f(x — x°) = 0 for 
every x € R. Invoking once more the surjectivity of f, it follows that f = 0. 
Therefore, (B.4) gives h = 0. Then, the functions f, g and h that satisfy the 
stated conditions are the constant functions f = f(0), g = g(0) andh = A(0), 
such that f(0) + g(0) = h(0). 


Section 9.2 


1. The idea is to adapt, to the present case, the proof of the uniqueness of limits of 
sequences, given in Proposition 7.14. More precisely, suppose that L and M are 
distinct reals and that f(x) converges simultaneously to L and M when x —> xo. 
Take € = 5|L—M| > Oand 6 > 0 such that x € J and 0 < |x — xo| < 6 give 
If (x) — L| < € and |f(x) — M| < e€. Then, use the triangle inequality to arrive at 
a contradiction. 

2. Suppose we had L > M, and let € = ou > 0. Take 6 > O such that x € J 
and 0 < |x — xo| < 6 give |f(x) — L| < € and |g(x) — M| < e. Use the triangle 
inequality to arrive at a contradiction. 

3. Make induction on n. The initial case n = 2 is exactly the content of items (a) 
and (b) of Proposition 9.7. 

4. For right-handed limits, for instance, copy the statement and proof of Proposi- 


tion 9.7, changing, whenever convenient, lim by lim and0 < |x—x9| < 6 
x>X0 X>xog+ 


(resp. 0 < |x — X9| < 61, 62) by x9 < x < xo + 6 (resp. x9 + 51, x9 + 62). 
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Implication => and the last part follow from the fact that each one of x9 — 6 < 

x < Xo and xy <x <x +60 < |x—x9| < 5 do imply 0 < |x—x9| < 6. 

Conversely, suppose Dm, f(x) = lim f(x) = L, and let € > 0 be given. 
xX—>x0 x >xX9Q—- 

Take 6,6. > 0 such that each of xg — 6; < x < x9 and xp < x < x9 + 6 imply 

f(x) — L| < €. Letting 6 = min{d,,5)} > 0, conclude that 0 < |x —x9| < 6 

implies |f(x) — L| < . 


. Suppose that lim f(x) = L > 0 and lim g(x) = +oo (the other cases are 
X—>x0 X—>X0 


completely analogous), and let M > 0 be given. By the sign-preserving lemma, 


there exists 6, > 0 such that x € 7,0 < |x—x0| < 6; > f(@) > f: by the 
definition of infinite limit, there exists 6. > 0 such that x € I, 0 < |x—x| < 
5) => g(x) > mM Then, let 6 = min{6,,6,} and conclude that for x € J 


satisfying 0 < |x —x9| < 6 we have f(x)g(x) > M. For what is left, start by 

observing that the analogous result for limits at infinity (at -+oo, for instance) 

saysthatif lim f(x) =L>Oand lim g(x) =+00,then lim f(x)g(x) = 
xX—>+00 xX—>+00 x—>+00 


too. For the proof of this fact, copy the former one, changing 0 < |x—x9| <6 
by x > A whenever convenient. 


. Suppose first that a,b,L,M € R (the cases in which a = —oo, b = +00, 


L = —oo or M = +oo are totally analogous). Given € > 0, let 6 > 0 be such 
thata <x <a+6andb—6 <y < b respectively imply f(x) < L+ € and 
g(x) > M—e. Then, apply the IVT to conclude that [L + €«,M —«] C Im/(f). 


. In all items one has to use the fundamental trigonometric limit, together 


with some algebra. For items (a) and (b), multiply both the numerator and 


« z : +. sin(2) 2 _ sin(2. a 
denominator by 1 + cos x; for item (c), write ate = 3. ane . 5 for (d), 


multiply both the numerator and denominator by oe for (e), use (d) and the fact 
3 


I~y 
I+y+y? 


that cos (4x) = sin (= - x); finally, for (f), apply the identity 1 —y = 


with y = ~/cosx and, then, the result of item (b). 


. For the first part adapt, to the present case, the proof of Corollary 9.9. In order 


to compute the limits, use the result of the first part, together with the fact that 
| sinx|, | cosx| < 1 forallx eR. 

Given € > 0, let 6 > O be such that x € J and 0 < |x —xo| < 6 imply 
If(x) — L| < €. Take no € N such that n > no => |ay — x0| < 6. Then, for 
n > no we have 0 < |a, — x0| < 6 (since a, # xo for every n > 1), so that 
\f (an) — L| < €. For the second part, compute f(a,,) and f(b,), where a, = 
and b, = aa forn> 1. 

For the first case, if lim,—++90(f(x) — (ax + b)) = 0, then 


0= lim (eo) - lim (-.-*)- im ey 


xX—>+00 Xx x—>+00 x x x>+00 X 


at. 
2nx 


and 


O= lim (f(@) — (ax+ 5) = lim (f(a) — ax) —b. 
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The second case (i.e., that of f : (-oo, A) — R) is entirely analogous the the 
first one, so will be left to the reader. 

In item (a), if y = ax + B is an oblique asymptote when x — +00, then 
a= Him,+-+00 MO = lim, 400 (I + 4) = Land B = lim, 400(f(x)—ax) = 
lin 233.4 ; = 0. Analogously, y = x is an oblique asymptote when x > —oo. 
In item (b), if y = ax + 6 is an oblique asymptote when x — +00, then 
f@) 


7 2 
a = limy++400 = lim,-+ +00 2 1-5 = and 


6B = lim (@)-ax) = lim g (ve —a —z) 
x—>+00 x>+00 a 


Te aa 0 
=— lm —- {———] =0. 
x>+o00 q /x2 —~q2+x 


Analogously, y = —2x is an pone asymptote when x — —oo. For item (c), 
observe that ae x sin + = = 1. Finally, in all of these items, we leave to 
the reader the analysis of the ¢ existence of horizontal or vertical asymptotes. 
Prior to making x —> +00, write successively 


- 1+ 
Be ee a 
xt fat Jat Vx I+ /etztl 


We can suppose n > 2, so that AO) ~ £29 On the other hand, if k € N is such 


I ~ Ff" 
k frx) fk) _ pyk-l f@/thy 
that 2° > n, then Fo < GG = jai fap: Hence, 


fx) _ fx) Ff!) 
FO ~ FO "jy FO” 


and it suffices to make x —> +00 in the above inequalities and invoke the 
squeezing principle to conclude the proof. 

For the second part of item (a), use induction. For item (b) and for the first 
part of (c) use some Trigonometry, together with the fact that a, = 2”/2 — by. 
Finally, for the second part of (c), compute 6), write a, = 26, - weet and 
apply the fundamental trigonometric limit. 

Start by letting x = 1 in (a) to get f(f(y)) = yf(1) for every y > 0. Then, 
use such a relation to show that f is injective and, thus, that f(1) = 1 and 
S(f(@)) = x for every x > 0. Ifa > 0is a fixed point of f, show that + also is, so 
that we can assume a > 1. Conclude that a‘ is a fixed point of f for every k € N, 
and use (b) to show that a = 1. Finally, make x = y in (a) to get f(x) = + for 
every x > 0. 
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1. For item (a), observe that £2-8G0 = ¢. LWfO0) For item (b), Bishi) — 
XxX—-X0 x—-x0 X—-X0 
fe)=F00) 


X—XQ 


—2(22 F(x 
2. For x in the domain of g, we have sw-e(2) =c: Ele) fo) Now, making 
x— CxX—X0 
y = y(x) = cx, we have y > xo if x > *, so that 
x) -— g(* = 
tim 28) _ 6 jig LO=LOO) _ org), 
xo X-F y>% VY — XO 
3. Make induction on k. 
4. For arcsin, take x € (—1, 1) and let y € (—4, 4) be such that siny = x. Since 


sin’ y = cosy 0, it follows from Theorem 9.28 (with the roles of x and y 
changed) that arcsin is differentiable at x, with 


1 1 
sin'y cosy J/J—sinty VI—x2 


The reasoning for arccos is completely analogous and will be left to the reader. 

. Use the results of the three previous problems. 

6. For the second part of (a), note that f’(1) = n and g’(1) = m, so that f’(1) = 
g’(1) only if m = n. For item (b), we have f(xo) = f-! (xo) = yo and f’(xo) = 
(f—') (xo) (since the tangent lines to the graphs of f and f~! at A must coincide). 
Now, apply Theorem 9.28 to conclude that f’(xo)f’(vo) = 1. 

7. Express Newton’ quotient of x +> f(x) at xo in terms of Newton’s quotient of f 
at Xo. 

8. Start by writing 


f (bn) — (an) = fbn) — fo) . bn — Xo di f (Xo) —f(Gn) _ 0 — An 


Dn — an Dn — x0 Dn —an X0 — An Dy —4n 


Nn 


Then, use the triangle inequality to get 


fbn) =f (Gn) 29)\z F(bn) — fo) -1| bn = Xo 
Dn — an Dy — X0 Dn — an 
ait oe —f (Qn) -1| . X0 — an 
X0 — an Dy — an 
< ed =Feo) _ 1 , |feod-faw) _ || 
Dn — Xo X0 — An 
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For item (a), note that the tangent line to the graph of f at (a,,f(a,)) has 
equation y—f(a,) = f’ (an) (x—a,). For (b), make f(x) = x*—a and use the result 


of item (b) of Problem 1 to conclude that dy4, = dp — (=) =i (a, + +), 
For item (b), write Newton’s quotient of f in xp as 


VR? — 3x2 — ,/R? — x5 =. i (R2 — x2) — (R2 — x2) 


0 
xX — Xo /R2 — x2 + [R? — xi 
xX+ Xo 


then, make x — xg. For (c), let r be the straight line that passes through A and 
is perpendicular to OA. Elementary Analytic Geometry (cf. Chap. 6 of [4], for 


instance) teaches us that the slope of r equals —i , where m is the slope of OA. 


Go) 


Since m = £2. it follows that the slope of r amie Now, (b) gives 


Foo" 


= = Ge, as we wished to show. 

For the first part, since (xo, yo) belongs to the graph of f, we must have yo = 
J (xo). Now, since r is tangent to the graph of f at (xo, yo), its slope must be equal 
to f’(xo); on the other hand, since r passes through (a, b), its slope must be also 
equal to yore For what is left to do, if a straight line passes through (1, —1) 
and is tangent to the given parabola at (xo, yo), it follows from the first part and 


from Problem | that 


2 
“0 


Yo = ; 
ee 1) = 300-1) 


2 
Hence, “0 + 1 = 2(xo — 1), so that x9 = 1+ V5 and yo = aes 
Choose a cartesian coordinate system such that F(0, f) and d is the straight 
line y = —§. Since Q(x,y) € P if and only if FO = dist(Q;d), a simple 


computation gives y = = as the equation of the parabola. Now, letting 


2p 
P(xo, — fy and Q(x, + x), use the result of the previous problem to conclude that 


> 2p 


PQ is tangent to P if and only if 


x p 
4 (1 2\_ 7 CT) 
dx \ 2p x— X09 
or, which is the same (cf. Problem 1), x? — 2x9x — p” = 0. Letting a and B 
be the roots of this second degree equation, we have A(q, ©) e B(B, 5 B ip) On 


a 
2p 2 a a =p 


=o SS oe however, since 


the other hand, the slope of AF ism = 
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2 2 : 2x x 
a~ — 2x9a — p~ = 0, it comes that m = ae = = Analogously, the slope of 


BF also equals = so that AF = BF, andA, B and F are collinear. 
13. Since Vk ¢ Q, we have f(k) = 0. Now, letting x be an irrational number 
different from Vk, we have a = 0. It thus suffices to show that 


him £2 LW 
im ———_ 


= 0. 
x Jk x= Vk 
xEQ 


To this end, let x = - with p,q € N, be an irreducible fraction. We have 
a d 
piq—-Vel Pip—avel 
k k 1 
_ ptavk a Ptah 1 (2 va), 
q\4 


— Plp- Pk 


f@) —f(Wk) 
x— Jk 


Given € > 0, take n € N such that n > aver and let 
A= {p/a:p.qEN,0<q <n}. 
Note that A is a finite set. Letting 
6 = min{1, {|x— Vk|; x € A}}, 
we have [2 — Jk| < 6 > q =n, whereas : < J/k + 1 implies 


1 27k +1 
=1(2+ ve) <4 <. 
q \4q n 


ff) —f(Vk) 
x— Jk 


Section 9.4 


1. Apply the formulas for differentiation of sums, products and quotients, in 
conjunction with the computations of derivatives performed in this and in the 
previous section. 

2. For x € (0, +00), let h(x) = 2/x, so that h is differentiable by Example 9.29, 
with h(x) = dass, Since g(x) = (ho f)(x) and f is differentiable at 
Xo, the chain rule guarantees that g is differentiable at x9, with g’(xo) = 


WF (x0) )f’ (x0) = 4F x0) * fo). 
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3. Apply the differentiation formulas for the sums, product and quotient of two 
functions, together with the derivative computations of this and the last section. 
For items (e) and (f), start by reviewing the results of Problem 4, page 300. 

4. Ifr = 4, with m,n € Z*, then f(x) = x". For x > 0, let g(x) = 4%, so that 
g is differentiable (by Example 9.29). Since x +> x” is also differentiable (for 
x # Oifm < 0) andf(x) = g(x”), the chain rule guarantees the differentiability 
of f, with f’(x) = g’(x”) - mx"!. Then, again by Example 9.29, we have 


f@= a (ap ee a goer =. pel oe 
n n 


5. For the first part, differentiate both sides of (9.22) and, then, divide both sides 
of the resulting equality by f(x). For the second part, differentiate both sides of 
the equality f(x) = a(x — a1)(x — a2)... (x — @,) and, then, divide both sides 
of the resulting equality by f(x). 

6. Straightforward computations give f’(x) = —7(x — 3)~*, f(x) = 14(«— 3)°3, 
fO() = —42(e — 3), FO) = 168(x — 3) and f© (x) = —840(x — 3). 
Hence, f°) (0) = —840- 37° = —$8. 

7. Set fi(x) = /x+ | and, for an integer n > 2, 


fix) = xt yxte t+ yxt x+1 


with n square root signs. For n > 2, we have fy,(x)? = x + fr—1(x). Hence, 
it follows from the chain rule that 2f,(0)f/(0) = 1 + f/_,(0) or, which is the 
same, 2f/(0) = 1+/f/_, (0) (since f,,(0) = 1). Now, since f/(0) = 4, we can use 


the previous recurrence relation nine times to get f/)(0) = we. Alternatively, 
observing that f7(0)-—1 = 5 ”_,(0)—1), we get f/(0)-—1 = ser (f{(0) -l= 


-+ and, thus, f7(0) = 1 — x 


8. Make induction on k. For the induction step, differentiate x b> fhe using the 
formula for differentiation of a quotient, together with the chain rule. 
9. Suppose that the straight line y = ax + b is tangent to the given parabolas at the 


points with abscissas a e 8, respectively. Then, a7 = awa+b and 2+ (B—3)* = 
aB +b, whilea = £x° ba 2a anda = 4x? get (B —3)?) = 2(B—3). 


Hence, a = B—3 and b = a*—2a-a = —a*, b = 2+ (B—3)*—2(B—3)-B = 
—f* + 11, so that a? = B? — 11. Solving the system of equations in w and B 


thus obtained, we get a = 7 and Bp = 2. From this, we easily find a = z and 


1 oo: 2 1 
b= —3, So that the common tangent line is y = gx — 5: 
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10. For item (b), solve the system formed by the equations + +? ys = land + - 


a a’ 


a = 1, concluding from (a) that it has exactly four distinct solutions. For 
(c), analyse the case in which P(x, yo), with xo, yo > O (the remaining cases 
can be dealt with in analogous ways). Show (using either the chain rule, implicit 
differentiation or directly the result of oe 2) that the slopes of the tangents 


to € and H at P are wut equal to —3 - a and uv . en Finally, conclude 


that it suffices to show that 2 at a : 74 = |, and use the result of (b) to verify that 
0 


such an equality is indeed true. 

11. For item (b), take g to be equal to the sum of the functions x > f(x—x,), where 
f is as in item (a). 

12. Adapt, to the present case, the discussion of Example 9.41. 

13. Firstly, observe that a, + 2a. + --- + na, = f’(0). Then, use the fact that 
If(x)| < |sinx|, together with f(0) = 0 and the fundamental trigonometric 
limit, to get |f’(0)| < 1. 

14. Letting p > 0 be the period of f, show that f (p) = f(0) for every integer 
k => 0. Then, looking at these n equalities as a linear system of equations in the 
unknowns cos(jp), | <j <n, conclude that its only solution is cos(jp) = 1 for 
LAjsn. 

15. Firstly, verify that, for x € (a,b), the discussion that precedes Lemma 9.36 
allows us to write 


fe) _f@ts 
g(x) _gi(a) + 


—a 


with lim,+, re and lim,-+, w= 0. 


Section 9.5 


1. Let gh : R — R be given by g(x) = f(x) — f(0)cosx and h(x) 
g(x) = f’(x) + f(0) sinx. Verify that h’(x) = —g(x), and apply the result 
of Example 9.48. 

2. Letg: R > R be given by g(x) = f(z): Verify that g’(x) + g(x) = 0, and 
apply the result of the previous problem. 
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3. 


13. 


14. 


15. 


. Apply l Hépital’s rule for =. 


If g(x) = f” (x0) (x — xo), use Corollary 9.47 to conclude that f’ — g is constant 
and, hence, that f’(x) = f’(xo) + f” (xo)(x — xo) for every x € J. Then, put 
h(x) = f’ (x0) (x — x0) + LOO ( — xo)” and use a similar reasoning. 


. Use the given inequality to conclude that f is differentiable, with f’(x) = 0 for 


every x € R. Then, use Corollary 9.47. 


. Suppose first that c is the left end of J. Apply Lagrange’s MVT to an interval 


[c,x] C J to conclude that f{ (c) = lim,.4 foe & =i © exists and equals /. Then, 


do the same if c is the ne end or an interior point of I. 


. Apply Rélle’s theorem to @x + 4x? _ atl 
. Use the IVT to conclude that f ae — xsinx — cosx has at least two real 


roots, one in the interval (—4, 0) and aa in the interval (0, 4). Then, notice 
that f’ has a single zero, and apply RGélle’s theorem to conclude that f(x) = 0 
cannot have a third real root. 


. Apply Lagrange’s MVT. 
. In each of the items, apply I’ Hdépital’s rule for 2. 


0 
0 


. Apply P Hépital’s rule for ° twice. 
. For item (a), use Lagrange’s MVT twice to get f(x) —(x—a)f’(x) = —f" (€)(x—- 


a)’, for some & € (a,x); then, use the continuity of f”, together with the fact 
that f” (a) # 0, to guarantee the existence of c € (a,b) such that f” 4 0 in 


[a,c). For (b), let g(x) = me and use |’ H6pital’s rule twice to get 
_ 2f'(a) 


lim,+a¢ g(x) = f(a)’ 
Apply the ordinary I’ Hépital’s rule to F, G: (0, 7) — R, given by F(x) = f(4) 
and G(x) = g(t). 


Firstly, observe that HO) _ 4 (xf (x)) = f(x) +2f’ (x). Now, our hypotheses 
ax* 


and the extension of Seana 9.7 to limits at infinity guarantee that 
lim, +00 (f(x) + xf’(x)) does exist. Hence, by the version of |’ H6pital’s rule 
given by Problem 13, we have 


a 
clin £0) = tim, T= tim BES = in 0) +96) 
= lim f(x) + lim (f'@)). 


Therefore, lim, +00 (af (x)) = 0. 
Suppose, without loss of generality, that lim,+, g(x) = +-oo. Given € > 0, we 


want to find 0 < 6 < b—asuchthata <x <a+5>53 fe 


-1 < €. 


Take 0 < 7 < b 


_ £. i 
TQ) L Letting 
f@-feo) _ fc) 
8(x)—g(x0) 8’ (cx) 
<L ae = and, hence, 


for 


xo = a+ 7 and taking a < x < xo, show that one has 


f@)—fo) 
g(x)—8(xo) 


some cy € (x, Xo). Then, conclude that L — 5 < 
that 
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16. 
17. 


18. 


19. 
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2 


é Aveta, 76a Fe) : use. #6) 
B55) ay cay ey a a) es 


Finally, use the fact that g(x) > +00 as x > a to find 0 < 6 < n such that, for 
a<x<a+t+46, 


€ 


a 


(9 - 
27 g(x) — g(x) 


Justa) $8) 
|(e+ a) ea ee) 


Use Lagrange’s MVT, together with the result of Problem 11, page 263. 

Take a sequence (x,),>1 in J \ {xo} such that x, > xo. If x, > xo (resp. Xn < Xo), 
then, applying Lagrange’s MVT to the interval [xo, x,] (resp. [xn,xo0]), conclude 
that there exists y, € (x0, Xn) (resp. Yn © (Xn, X0)) Such that 


_ fin) = fo). 


Xn — X0 


f' On) (B.5) 


Now, since y, — Xo and y, # Xo, we have f’(y,) — L. Therefore, it follows 
from (B.5) that a — L. The above reasoning has shown that 


fn) = FG0) _ 


Xn — XO 


XX, > XxX => L. 


f@)=f x0) 


Argue by contradiction to show that this suffices to prove that lim,-+, = 


does exist and equals L. 

If such an f exists, start by writing the given relation as Bet < ae 
Conclude that f’(x) < —1 for every x > 0 and, then, use Lagrange’s MVT to 
show that f(x + 1) < f(x) — 1 for every x > 0. Finally, use this last inequality 
to show that f should take negative values, which is not the case. 

Start by using Lagrange’s MVT to show that there exists 0 < 6 < 1 for which 
If (x)| < |x| whenever |x| < 6. Then, observe that |f’(x)| < |x| for |x| < 6, and 
use Lagrange’s MVT once more to obtain |f(x)| < |x|? for |x| < 6. Iterate this 
reasoning, thus showing (with the same 5 > 0) that [f(x)| < |x|” for |x| < 6 
and every n € N. From this, conclude that f(x) = 0 for |x| < 6. Extend the 
argument above to show that, if f(xo) = 0, then there exists 6 > 0 such that 
f(x) = 0 for |x — x0| < 6. Finally, invoke the result of Problem 5, page 272, to 
get f(x) = 0 foreveryx € R. 


Section 9.6 


1. 


Consider the quadratic f(x) = ax* + bx + c, with a > 0 (the case a < 0 is 
completely analogous). Since f’(x) = 2ax + b and a > 0, we have f’(x) < 0 
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if and only if x < ~ 34 and f’(x) > 0 if and only if x > —3. Therefore, f 


decreases in (—oo, <a and increases in (-Z, +00), so pe f atts a global 
minimum at x = -£ ~ (and only there), with f (-£)= = -A 


2. In all of the items, spol the first derivative test in comminceen with the study of 
16—3x? 
24/x(x?2-+ 16)? 
0. Therefore, x = az is the only critical point of f, which increases in [0 


the first variation. For instance, in item (a) note that f’(x) = for x > 


4] 
and decreases in [ ies Hence, x = a is the only global maximum point 
of f, with f() = = aa Now, observe that f(0) = 0 and f(x) > 0 for x > 0, 
so that x = 0 is the only global minimum point of f. The remaining items can 
be dealt with likewise. 

3. First of all, note that f has no points of global maximum or minimum, since 
(cf. Example 9.17) lim,—+—o5 f(x) = —oo and limy-++40 f(x) = +00. Now, we 
have f’(x) = 3x* + 2ax + b, with A = 4(a? — 3b). If A < 0, then f’(x) > 0 
for every x € R, so that f increases along all oF the real line. If A = 0, then 
f(x) =0 4 x = —$ and f(x) > 0 for x # —$; hence, — is the only critical 
point of f, which still increases along all of the ial line. Finally, if A > 0, then 
f’ has exactly two real roots, x; < x2; moreover, f’(x) > 0 forx < x, orx > x2, 
while f’(x) < 0 for x, < x < x; therefore, f increases in (—oo, x;], decreases 
in [x1, x2] and increases again in [x2, +00). Thus, f attains a local maximum at 
x, and a local minimum at x2. 

4. First, let f : [0, +00) — R be given by f(x) = sinx — x. Then, f’(x) = 
cosx — 1 < 0, so that f is nonincreasing in [0, +00). Hence, for x > 0 we have 
f(x) < f(0) = 0. Now, let f : [0, +00) — R be given by f(x) = cosx—1+ z 
By the first part, we have f’(x) = —sinx + x > 0 for every x > 0, so that 
f is nondecreasing in [0,-+00); thus, for x > 0 we have f(x) > f(0) = 
Finally, let f : [0, +00) > R be given by f(x) = sinx —x + x“ and note that 
f'(@) = cosx— 1+ © > 0; this gives f(x) > f() = 0 for x > 0. 

5. Verify that als DOR, so that DR = x(2 — x) if AO = x. Then, compute 
PO = Vx? +1 and OR = (2 —x)Vx? +1, so that POR has area equal to 
5 PO- OR = te x)(x? +1). Finally, study the first variation of f : [0,2] > R, 
given by f(x) = (2—x)(x* + 1), to conclude that it attains a maximum only for 
— Zee 7 

6. Let Aa, f (@)) and A’(x, f(x)), so that (thanks to elementary Analytic Geom- 
etry) a is a point of global minimum for d : (a,b) — R given by d(x) = 
(x — xo)? + (f(x) — yo)?. By the test of first derivative, we have 0 = d'(a) = 
2(a — xo) + 2(f(a) — yo)f’(@), so that either @ = xo and f’(a) = 0 (since 
P ¢ Gy), or a # Xo and f’ (a) (Hs =) = —1. Now, use the fact that the 
product of the slopes of two perpendicular lines, none of which being vertical, 
equals —1. 


7. Just notice that, from the previous problem, AB is perpendicular to the tangents 
to the graph of f at A and to the graph of g at B. 
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8. 


10. 


11. 


12. 
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If A(a, a”) and B(b, 1 — (b— 3)?), then AB = (a—b)? + (2 -1+ (b-3)?). 
Condition f’(a) = g'(b) gives us b = 3 — a, so that AB’ = 4a‘ — 12a + 
10; hence, we are left to minimize h(x) = 4x* — 12x + 10. The test of the 
first derivative gives a = ¥/3/4 as the only minimum point of h, and an easy 


computation gives AB = 4/10 — 94/3/4. 


. Let AH = h, BH = 1, HAB = a and BC = 0. Apply some Trigonometry to 


tan(a+6)—tana _ —2nl 


get tan@ = I-+tan(a+6)-tana ~ 7+32° 


Now, if f() = oie for / > 0, compute 


(h?-+317)? 
global maximum. 
Let 6 = BAD = ADC, so thatO0 < @ < oe AD = a+ 2bcos@ andh = 
bsin 6, where h is the height of ABCD. Hence, A(ABCD) = 5(AD + BC)h= 
(a + bcos@)bsin 6, and it suffices to maximize f : (0,4) — R given by 
f(@) = (a+ bcos @) sin @. To what is left to do, use some Trigonometry to 
get f’(0) = 2bcos? 6 + acos 6 — b and f” (8) = —2b sin(26) — ab sin 6. Now, 


"7 4 = 2b 
check that f’”(6) < 0 for every 6 € (0, 4) and that 6 = arccos cree ra Teas a] 


ro 2h(h2—3/) me bY ah ‘ : 
I’ = =~ and conclude that the critical point / = A of f is a point of 


is the only critical point of f. Finally, apply Corollary 9.60 to —f. 


In the notations of Example 9.61, we have PX- PY = eae 


OP sina sin B is constant, we conclude that PX - PY is minimum if and only 
if f(@) = sin @ sin(a + B + @) is maximum, where f : (0,7 —a — B) > R.In 
turn, the first derivative test assures that if a global maximum point for f exists 
at all, it should be a critical point of f. However, since 


Since 


f' (@) = cos 6 sin(a + B + 6) + sin@ cos(a + B + 8) = sin(a + B + 26), 


it follows that 6 is critical if and only if @ + 6 + 20 = mz or, which is the 
same, if and only if 9 = 7 —a — B — 0,iie., if and only if OXY is isosceles 
with basis XY. It now suffices to show that, OXY being isosceles with basis 
XY (or, which is the same, letting 6 = $(x — a — B) — which is the only 
critical point of f), then f does attain its maximum possible value. To this end, 
just note that if0 < 6 < (x —a — B) then 0 <a+ 8+ 26 < a, so that 
f' (0) = sin(aw+f8+26) > 0; on the other hand, if }(7—a—B) <6 <1-a-B, 
then <a+ 8+ 26 < 27, so that f’(0) = sin(a + B + 20) < 0. Therefore, 
it follows from Proposition 9.57 that f is increasing in (0, 5 (x — a — B)) and 
decreasing in (3 (x —a-— 8), —a—f), so that 6 = 3 (x —a —§) is, indeed, 
the only global maximum point of f. 
In the notations of Fig. B.6, we shall prove that the position of P along r that 
minimizes AP + BP is such that a = f. 

To this end, start by observing that, had we drawn the point P to the left of 


A, then we would have a > 3; accordingly, had we drawn P to the right of B, 


we would have B > 5. Therefore, a, B € (0, 7). 
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Fig. B.6 Minimizing AP + BP 


13. 


If P = A’, Pythagoras’ theorem gives AP-+ PB = a+ Jb? + c?; if P = B’, 
we analogously have AP+ PB = b+ Ja? + c?. If P 4 A’, B’, then, computing 
the sines of a and f in the right triangles APA’ and BPB’, we get (even if a or 
is obtuse) AP + PB = “ + > Also, it follows from A’P + B’P = c that 


sina sin B* 


a b 


=C¢, B.6 
tana tanp c a 


and this is also valid even if @ or B is obtuse. 

Now, note that 6 continuously depends on a. On the other hand, a separate 
analysis of the cases of P to the left of A’, P between A’ and B’ or P to the right 
of B’, together with (B.6), show that 6 actually depends smoothly on a. Then, 
we just need to minimize d(@) = s4> + re with 6 being given by (B.6). 


Use the first derivative test to show that d’/(a) = —— = ane - B'(a), 
so that d’(w) = 0 if and only if B’(a) = —¢.- cosasi” 8 On the other hand, 


bcos B sin? w 


differentiating (B.6) with respect to w we get —5— + wa - B'(a) = 0, so that 


Ba) = -§- 8 Comparing both expressions above for 6’ (a), conclude that 
‘(a=0e a =lsa=fB. 


We are left to showing that the position of P at which a = 6 actually 
minimizes AP + PB. To this end, computing d"(a) (for a generic a) and 
recalling that a, B € (0, 7), we obtain d” (a) = al + cos? @) + aad + 
cos” B) > 0. Hence, Corollary 9.60 guarantees that the critical point of d is 
indeed a minimum global point, among all positions of P along r, as long as 
PH#A',B’. 

Finally, conclude that when w = f, then d(a) = Vc? + (a+ b)?. Finally, 
observe that \/c2 + (a+ b)? <a+ Jb? + c?,b + Va? + c? (which are the 
values of AP + PB when P = A’ or B’). 

Fix a cartesian system for which A(0, a) and B(/, b), and let P(x, 0), withx € R. 
Use basic Analytic Geometry to show that it suffices to prove that f : R > R, 
given by f(x) = (x7 + a*)((x — 1)? + b*), has a single minimum global point, 
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14. 


15. 


16. 


17. 


18. 


19. 
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provided / < /2(a? + b). Compute f(x) = x4 — 2b3 + (@ +b? + P)x* — 
2a? Ix + a?(b? + P), f' (x) = 40° — 6b? 4+ 2(a? + b? + P)x — 2071 and f’ (x) = 
12x? — 12/x + 2(a? + b? + 7). Then, show that f’ has a root in the interval (0, J) 
and examine under what conditions we have f” (x) > 0 for every x € R. 
Conclude that it suffices to show that f(x) = _ x > O, attains its 
minimum value at x = n. To this end, study the first variation of f. 

Firstly, show that y = + (kv 1+? -—x)+i(V1 F224 x). Then, study the 
first variation of f : R > R given by f(x) = x+y. 

Use the inequality between the arithmetic and geometric means of two reals 
to show iat the peat expression is greater that or equal to f(x + y), where 
f®=t+2 es z fort > 0. Then, study the first variation of f. 


Letting a = sin $ # and Bp = sin® a show that it suffices to analyse the equality 


f(a) = f(B), where f(x) = a Tay? for x € (0, 1). To this end, study the first 
variation of f. 

Use the MVT to guarantee the existence of c € (a,b) such that f’(c) = 1. If 
f(c) = c, it suffices to apply the MVT twice. If f(c) < c, use the MVT to assure 
the existence of a € (0,c) and B € (c, 1) such that f’(a) < 1 and f’(B) > 1; 
then, apply Darboux’s theorem. If f(c) > c, argue alalocously: 

Write f(x) —f’@) —f"@) +f") = F@) —f'@) — £6 @) —f" (0). Then, 
apply the following claim twice: if g is a polynomial fincuos of positive degree, 
such that g(x) — g(x) => 0 for every x € R, then g(x) > 0 for every x € R. In 
order to prove this claim, start by using the result of Example 9.17 to show that 
g has even degree and positive leading coefficient. Then, take (by the result of 
Problem 10, page 263) x9 € R such that g attains its minimum at xo and apply 
the first derivative test. 


Section 9.7 


1. 


2 


For a < bin/J and 0 < t < 1, it follows from the strictly convex character of g 
that g((1 —t)a+ tb) < (1—J‘)g(a) + tg(b). Now, since f is increasing, we have 
(fo g)((1— fa + tb) = f(e((1 — fa + tb) < f((1 — g(a) + tg(6)). Finally, 
use the fact that f is also strictly convex to show that the same is true of f 0 g. 
Firstly, Theorem 8.35 shows that f is either increasing or decreasing. Suppose 
that f is increasing (the other case in totally analogous). For a < b in J and 
0<t< 1, we have f((1 —ft)a+ th) < (1 —af(a) + #f(b). Let a = f(a) and 
B = f(b), so that a = f—!(a) and b = f—!(B). Applying f—! to both sides of 
the above inequality and invoking the result of Problem 8, page 176, show that 
(1 —af7!(@) + tf !(B) < f-'( — De + #8). From this, conclude that f~! is 
strictly concave. 


. Use the definition of strictly convex (resp. concave) function, along the lines of 


the hints given to the two previous problems. 
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4. Write f(x) = “SS = -Vl-x+ 7 and show that each one of the 
functions t  —./t and t a t € (0,1), is strictly convex. Then, use the 


4—x 
mae 


result of the previous problem. Alternatively, compute f”(x) = 
solve the inequality f” (x) > 0 (resp. f” (x) < 0). 

5. Since f(x) = 6x + 2a, we have f” > 0 in (—$, +00) and f” < 0 in (—oo, —). 
Hence, f is eweny convex in (—4, +00) and strictly concave in (—oo, —$), so 
that x = —¢ is its only inflection point. 

6. Computing derivative, we get f”(x) = (2-4 =z) sin + = 2 cos 1, Now, ifn ¢ N 
and x, = -, then 4 = = nm, so that f(t) = re be: Ga = = (-1)""!2nz. 
Therefore, "on (xx) f”" (Xe-41) < 0, and the IVT guarantees the existence of y, € 
(X%n+1,Xn) such that f”(y,) = 0 (notice that we need not invoke Darboux’s 
Theorem 9.62, since f” is continuous in (0, 7): Verify that y, is an inflection 
point of f. 

7. Note that g is also twice differentiable, with g” (x) = 2f’(x) +xf” (x) = 0. Then, 
apply item (a) of Theorem 9.69. 

8. Suppose that f is convex. Give x, y € J, let us first show that f((1 —1)x + fy)) < 
(1—2)f (x) +460) (*) for every dyadic rational number t € [0, 1] (cf. Problem 5, 
page 206); to this end, oe s make menceay on k > 1. Fork = 1, we have the 
validity of (*) for t = 0, 5 + and 1. Fort = 3, applying the convexity of f twice 
we get 


4 2 2 


(2) -(22). t() +/0) 


f@+fO) 
< tt) sf )+ “F0). 


as wished. Finally, for t = i it suffices to interchange x and y in the reasoning 
above. Now, by the sake of induction hypothesis, let (*) be valid for a certain 
k € N, all x,y € J and every integer 0 < n < 2*. Fort = ai where 
0 < m < 2**! is an integer, let’s distinguish two distinct subcases: (i) m is 
even, say m = 2n: thent = ;, and the validity of (*) follows from the induction 
hypothesis. (ii) mis odd, say m = 2n + 1: then 


_m 1 m—1 m+1 a. n n+1 
= pet > 3 ger + per) 3a oe) 


Letting s = > andu = ue, we have s,u € [0, 1] and t = 5(s + u). Hence, it 


follows from the convexity of f and the induction hypothesis that 


612 


10. 


11. 


12. 
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f(x +09) =s((1-"*) s+ (=*)») 


(< = s)x + sy) + (= wx + ~) 


_ f= sx + sy) +f — wx + wy) 

~ 2 

1 

3d — s)f(x) + sf(y)) + (A — Wf) + uf))] 
S+u stu 

= (: = ) 109 ar (ro 

= (1— df) + #0). 


= 


Now, given x, y € J, the continuity of f assures that of g : [0, 1] > R, such that 
g(t) = A-d)f@) +40) -f(A—)x+p)) fort € [0, 1]. On the other hand, we 
have shown above that g(t) > 0 for every dyadic rational number ft € [0, 1], so 
that the density of them in [0, 1], with the help of Problem 13, page 264, assures 
that g(t) > 0 for every ¢ € [0, 1]. 


. Suppose f is strictly convex, fixa < binJand0<t<1l1.Ifec=(1—-—na+b, 


then a < c < b, so that we can choose a € (a,c) and B € (c,b) satisfying 
c= ae Moreover, letting a = (1 — s)a+ sb and B = (1 —u)a + ub, with 
s,u € (0,1), show that c = ae >t= stu Now, use the given condition 
together with the definition of strict convexity and the result of the previous 
problem, to obtain f((1 — t)a + th) < LO4H8) < (| —ng(a) + f(b). 


Use the hypotheses on f to show that, given positive and distinct reals x and 
rx) Ly f(y Q4y2 ; : 
y, we have wate) > (=) > £2), with at least one of these 
y x+y 2 


inequalities being a strict one. From this, deduce that seine > g (2) and 
apply the result of the previous problem. 

For the function of item (a), use the inequality of item (b) of Example 5.1 to 
show that (442) < “9° for all a, b > 0, with equality if and only if a = b. 
For item (b), use induction on n, together with the fact that f is increasing, to 
show that (4°) < au for all a,b > 0, with equality if and only if a = b. 
For (c), use Trigonometry to show that, on (0,7), we have sina + sinb < 
2 sin (44), with equality if and only if a = b. For item (d), argue as in (c), 
using Trigonometry to show that 2 tan(4t?) < tana + tanb for a,b € (0, 4), 
with equality if and only if a = b. 

If x = xo, there is nothing to do. Otherwise, Lagrange’s MVT (applied twice) 
gives a real c between xo and x such that f(x) — f(xo) = f’(c)(x — xo), and d 


between c and xo, such that f’(c) — f’ (xo) = f” (d)(c — xo). Hence, 
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FQ) — fo) —f' G0) — x0) =f (oe — x0) — f’ Ho) (% — Xo) 
= (f'(c) — f' (0) (x — x0) 
= f" (d)(c — x0) (x — 0). 


Now, since f is convex and twice differentiable, it follows from Corollary 9.70 
that f” (d) => 0. Then, taking into account that (c — xo)(x — x9) > 0, the above 
computations give 


FQ) — fo) —f’ 0) — x0) = 0. 


13. Suppose, to the contrary, that there exist natural numbers n; < no < n3 <-::: 
such that (f(7,))x>1 is an arithmetic progression. Use the strict convexity and 
the increasing character of f to successively get, for k > 1, 


fim) > ¢ ("4 ) 


and 2ng > np—1 + nx+1. Finally, arrive at a contradiction. 

14. Firstly, note that f : (0, +oo) > R, given by f(x) = x‘, is strictly convex. Now, 
apply Jensen’s inequality (9.31). 

15. For item (b), note first that (a) allows us to take O in the interior of AyA2... Ap. 
In turn, letting a; = A,OAi+1 for 1 < i < n (with A,4; = Aj), this gives 
0 < a; < a for every i. Finally, adapt to the present case the reasoning of the 
solution of Example 9.78. 

16. Letting a; = A,OAz44 for 1 < i < n (with A;4, = Aj), we have 0 < a; < 
7. Now, show that the perimeter of AjA2...A, equals r }*;_, tan ¢. Finally, 
observe that tan : (0, 9) — Ris strictly convex and apply (9.31). 

17. Sine Law givesa+b+c= 2R(sinA + sinB + sin C). Now, apply Jensen’s 
inequality (9.31) to sin: (0,7) > R. 

18. For item (a), use the fact that A(ABC) = A(ABP) + A(ACP) + A(BCP). For 
(b), show that f : (0,4) — R given by f(x) = i is strictly convex and apply 
Jensen’s inequality to get f(h) > 3, with equality if and only if P is the center 
of ABC. 

19. For the first inequality, apply (9.31), together with the result of Problem 4, page 
337. For the second, apply (9.31) again, this time using the fact that the square 
root function is strictly concave in (0, +00). 


Section 9.8 


1. Since sinx = cos(> — x), we have csc x = sec(> — x). Now, use the discussion 
of Example 9.79, together with the results of Problem 10, page 194. 
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2. We sketch hints to some of the is letting he other ones to the reader. For 
item (b), compute f’(x) = 1 — + and f” (x) = 5 to conclude that f decreases 
in (—1, 1), increases in each of (—oo,—1) and rio +oo0), is strictly convex in 
(0, +00) and strictly concave in (—oo,0). Also, use the result of Problem 11, 
page 291, to conclude be y = x is the only oblique asymptote of f. For item 


20 -= 
(f), compute f’(x) = wae =a and f”(x) = aie Ta to conclude that x = +1 are 


the only critical points of f, while x = 0 and x = +¥V3 are its only inflection 
points. Also, f’(x) > 0 if and only if |x| < 1, so that f increases along the 
interval (—1, 1) and decreases along each of (—oo, —1) and (1, +00); therefore, 


x = —1 is a local minimum point and x = 1 is a local maximum point of f. 
However, since f(—1) = —4,f() = ; and lim),)-++00 f(x) = 0, it follows that 
x = —1 is the only global minimum point of f, while x = 1 is its only global 


maximum point. Now, at the inflection point x = 0, the tangent to the graph is 
(trigonometrically) at | from the horizontal axis, while at the inflection points 
x = +43 such an angle equals arctan (—4). Note that f”(x) > 0 if and only 
ifx € (-V3, 0) U (73, +oo); therefore, f is strictly convex in each of these 
intervals. Accordingly, f is strictly concave in each of the intervals (—oo, — V3) 
and (0, /3). Finally, the horizontal axis is the only horizontal asymptote of the 


graph. For item (h), compute f’ (x) = 3 and f” (x) = ake to conclude 

that f decreases along all of its domain, is strictly convex in each of the intervals 

(—1, 0) and (1, +00), and strictly concave in each of (—oo, —1) and (0, 1). Also, 

= 0 is its only inflection point, in which the tangent to the graph makes a 

trigonometric angle of —4 with the horizontal axis. Finally, this axis is the only 

horizontal asymptote of f, with lim,_,4.0 f(x) = limy+—oo f(x) = 0; lines x = 

—1 and x = 1 are the only vertical asymptotes, with lim,_,(_1)_ f(x) = —oo, 
lim, (—-1)+ f(x) = +00, lim,.;— f(x) = —oo and lim,_,14 f(x) = +00. 

3. First of all, note that since f’(xo) = g’(xo), line r is tangent to the graph of g 


(hence, to I’) at A; thus, OA_Lr. For item (a), if f’(x9) = 0, then r is horizontal 
and, then, OA is vertical. If f’(xo) 4 0, basic Analytic Geometry assures that 


the slope of OA equals ci For (c), it follows from (x — a)? + (g(x) — 
B)? = R’ and the chain rule that 2(x — @) + 2(g(x) — B)g’(x) = 0; therefore, 
2 + 2¢'(x)? + 2(g(x) — B)g’(x) = 0. Now, substitute x = xo in the first equality 
and use the result of the second part of (a) to get the first equality of (c); then, 
substitute x = xg in the second equality and use the fact that g(x) = f(xo) 
and g'(xo) = f’(xo) to get the second equality of (c). For item (e), observe that 

= (xo — a)” + (f(xo) — B)? and substitute the values of w and B computed 
at (d). Finally, for the second part of (f), notice that condition f’(xo) = g’ (xo) 


guarantees that rL AO. 

4. For item (a), note that f is even. For (b), compute f’(x) = sin + _ i cos 1. For (c), 
observe that sin y — ycos y = 0 implies cos y ¥ 0 and, then, tan y = y. Now, use 
the IVT (sketching the graphs of y +> tany and y b> y may help) to guarantee 
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that the set of reals y > O such that tany = y can be put in a sequence (y,),>1, 
satisfying the given conditions. For item (d), note that 


2kat <y< yx, => tany < y=>siny—ycosy <0 


and, hence, that f’(x) < 0 for x € (xx, ); then, use a similar reasoning to 


1 
show that f’(x) > 0 for x € ( Tate XK), iit Xo is a local maximum point of 
f. Then, argue analogously to conclude that x2,_; is a local minimum point for f. 
For item (f), initially observe that, since f’(x,) = sin a 7 cos + = 0, we get 
[f(%n)| = |x, sin | = | cos +| = | cos y,|; then, apply the last part of (c). For (f), 
conclude first that. f'@® > 0 for x > x1; then, apply the fundamental trigonomet- 
ric limit to get limy++o0 f(x) = limy+o4 oe = 1. For (g), start by computing 
f"(«) = -4 sin 1. Then, for x > 0, note that f’(x) > 0 (resp. f”(x) < 0) 
if and only if sin + < 0 (resp. sin + > 0, and solve such inequalities. Finally, for 
item (h), use the result of the previous items, also noticing that |f(x)| < |x|, with 
[f(x)| = |x| if and only if x = ee for some n € Z. Your drawing should be 
somewhat similar to the one shown in Fig. B.7). 


Fig. B.7 Sketching the graph of x +> xsin 4 
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1. Adapt the proof of S(f; Q) < S(f; P) to this case. You shall need to use the fact 
that inf...) f> infty xf > inf. x: 

2. Take a partition P of [a, b] and show that s(f; P) = S(f; P) = c(b— a). 

3. Read the proof of Theorem 10.5 again to check that _jim GS (f; Pk) — 


s(f;Px)) = 0. Then, use the fact that f is integrable, together with the 

inequalities s(f;Px) < i f(xjdx < S(f; Px), to show that the limits 
lim S(f;P,) and lim s(f; P,)) do exist and are equal to Sf? fQddx. 

k>+00 k—>+00 


4. Letting Py = {a = x) < x) < +--+ < x, = bD} be a uniform partition, we 
have A(f;k) = s(f; Px) and A(f: k) = S(f; P;), so that (10.6) gives s(f; Px) < 
la (1- i < S(f; Px). Now, make k > +00. 

5. For item (a), substitute u = x7, v = y? and write down the expressions involved 
in terms of x and y. For item (b), let u = x; and v = x;_; in (a) and add the 
inequalities thus obtained, observing that xj — xj) = pa Finally, for (c) let 
k — +00 in the inequalities of (b) and use the result of Problema 3. 

6. For 1 < k < n, write L.—L. = 1.—L.. Then, if f@) = —~ and 


Jn k-+n n fE+ 1" Vx+1 
P, = {0, 1, 2, wana ml 1}, note that a, = s(f;P,). Finally, use the result of 


Problem 3. 


7. Forl < k < n,letn- sty = }- aay Then, take f(x) = yin, 
P, = {0, 1, 2, = — 1} and note that a, = s(f; P,). Finally, use the result of 


Problem 3. 

8. For the first part, use the given inequality to show that Cauchy’s integrability 
criterion is satisfied. For the second part, conclude (also from the given 
inequality) that limp.+o0 S(f; Px) and limg-++4o0 S(f; Px) exist and are equal. 
Then, use the fact that s(f; Px) < Sf? fddx < S(f; P;) to finish the proof. 

9. For item (a), adapt (10.17) to the present case, noticing that, if Mj = f (x}) and 
m; = f (x7), then Mj — m; = f(x) =f@) < c|x; — x} < cj —4j-1) = ea) 
For item (b), start by showing that U(f; Px; €) and ihe f (x)dx both belong to the 
closed interval with endpoints s(f; P;,) and S(f; Px). 

10. For the first part of item (a), note that 2 sin(a + jh) sin A = cos(a+ (— 1)h)- 
cos(a + jh), add such equalities from j = 0 to j = k and transform the result 
thus obtained in product. For the second part, just note that cosx = sin (4 —x). 
For the first part of (b), Example 8.8 shows that the sine function is Lipschitz, 
with Lipschitz constant c = 1. Let Py = {a = x9 < x1 < +--+ < x, = b} be 
a uniform partition of [a, b] and (in the notations of the previous problem) take 


&; = x; forl <j <k. Letting h = boa = xj —xj-1 for 1 <j < k, note that 


k Asin (a + aut) sin erph 

D (sin; Py; &) = bY sin(a + jh) = ————_*> "~~ sina. 
sin 3 

j=l 2 


11. 
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Then, apply the fundamental trigonometric limit, together with the result of 


item (b) of the previous problem, to conclude that 


b hsin (a + a) sin V4 
i sinxdx = lim | ———-—————_—————__ — vi sina 
a 


h—>0 sin = 


th  (bt+a h\. [(b-a_h 
= 2 lim | —, :sin ——)]sin +— 
h>0] sin 5 2 2 2 2 


b— 
) sin (—) = cosa—cosb. 
2, 


b+ 
= 2sin( 


For the second part, adapt the above argument. 

Firstly, note that, by the density of the irrationals in the real line, we have 
s(f; P) = 0 for every partition P. Now, give € > 0, fix no € N such that 
no > 2. Then, if A = {x € [0,1]; x = 4, with m,n € N and n < no}, show 
that A is finite and use this fact to guarantee the existence of a partition Po of 
[0, 1], such that the sum of the lengths of the intervals of Pop which contain some 


element of A is less than 5. Finally, show that S(f; Po) < €. 


Section 10.3 


. Let f(x) = xsin(zx). If Py» = {0 = x9 < x1 < +--+ < x, = 1} is a uniform 


partition, show that a, = 7X (f; Pa; &), where € = (&,...,&,) is such that & = 
x;, for 1 < j < n. Now, apply Riemann’s theorem. 


. More generally, let A = {x1, x2,x3,...} C R be countably infinite. Given € > 0, 


let J; = (x - a) for j => 1 and note that ea iJ = 5 <e. 


. Givene > Oandj > 1, let 4y;, Ij, ... be open intervals such that A; C Uist I; and 


Dies yl < gat: Then, {I; i,j = 1} is countably infinite, Ujs1 Ai Cc Uist ff 
and Pasi [Zyl < §. 


. Let Df, Dz and Dyz be the sets of points of discontinuity of f, g and Dg. Show 


that Dy, C Dy U Dy and, then, apply Lebesgue’s theorem, together with the result 
of the previous problem. 


. LetQ = {0 = yo < yy < +--+ < yy = 1} bea partition of the codomain (0, 1] 


of f and, for 1 < j < k, let A; be the inverse imagem of [y;-1, y;); let also Ay be 
the inverse image of [yx—1, yx]. Show that A; = @ for 1 <j < k and recall that, 
since A, = QN (0, 1], we have m(A;) = 0. Then, use (10.23) to conclude that 


Sofe = 0. 


. Firstly, assume that f is increasing, and let Q = {0 = yo < y) < +--+ < y = L} 


be a partition of [0, L]. For 1 <j < k, let A; be the inverse image of [y;_1, yj); let 
also Ax be the inverse image of [yx—1, ye]. If x; = f~'(y;) for 1 < j < k, show 
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that Aj = [xj-1,xj) for 1 <j < k, and Ag = [xg—-1, xx]. Then, if P = {a = x9 < 
X1 < +++ < x, = b}, show that Jiao fo = s(f; P). Now, consider the case of 
a nondecreasing f, and observe that the case of a nonincreasing f is completely 
analogous. 

7. Look at both integrals as the areas of regions under the graphs of f and f—', and 
recall that the graph of f—! is symmetric to the graph of f with respect to the 
straight line y = x of the cartesian plane. Then, reflect the region under the graph 
of f—! along this line and show that the union of the region under the graph of f 
to this reflected one contains a rectangle with sides a and b. 

8. Start by showing that, given t € [0, 1], the fact that f is nondecreasing guarantees 
that A(Ry) is greater than or equal to the area of the rectangle of basis [f, 1] and 
height f(¢), 1.e., that 


1 
[ forae> aan 2 fo -# 


Then, fix s € [0, 1] and let t = g(s) in the above inequality to get f(g(s)) —g(s) < 
i f(x)dx. Finally, use the fact that s € [0, 1] was chosen arbitrarily to conclude 
that 


1 1 
[ ee -seas = [ pooae. 
0 0 


9. Let f : [a,b]  R be a bounded function for which limjp|_,9 &(f; P; €) = J, with 
I not depending on the choice of the intermediate points & of P. In order to show 
that f is integrable, given « > 0 we need to find a partition P of [a, b] such that 
S(f; P)—s(f; P) < €. To this end, start by taking Po such that &(f; Po; €)—1| < § 
for every choice of intermediate points &. If P = {a = x) <x) <...< x, = bd} 
and &, nj € [xj-1, xj], show that 


k 
y. 
> FE) —F (mp) 10q — 43-1) < ~ (B.7) 


j=l 


Now, let Mj = SUP ix, 12] fim = inf, .] f and choose sequences (&n),>1 and 


(Njn)n>=1 in [xj-1, xj] satisfying f (Ein) = M;, and f(njn) = m;. Make & = &y 
and 7; = nn in (B.7) to show that (My; — mj)(x; — xj-1) < x. 


Section 10.4 


1. Modify the function of item (a) of Problem 11, page 311, taking, for instance 
f@) = (x—a)*? sin +, if x € (a, D]. 


x—a’ 
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2. 


RW 


11. 


12. 


Making xo = a and x%41 = b, Proposition 10.21 gives 


/ , fQodx = y I ; fQodx. 


Now, observe that, in this last expression, the sum of the summands ‘A i é' Sf (x)dx 
xj 


for which f > 0 in (xj-1,j) corresponds to A(R ), whereas the sum of those 
summands for which f < 0 in (x1, xj) corresponds to —A(R; ). 


. The hypothesis gives g1(x) — fi (x) = g2(x) — fa(x) for every x € [a, b]. 
. Use Proposition 10.19, together with Problem 2, page 357, and Lemma 10.17. 
. For item (a) and for j € Z4, let f; : [7,7 + 1] — R be the restriction of the 


fractional part function to the interval [j,j + 1]. If P = {0 = x) <x, <---< 
Xx = 1} is a partition of the interval [0, 1] and y; = x; +n for 0 <j < k (so that 
QO = {n=yo <y) <-+++ < yy = n+ 1} isa partition of [n,n + 1]), show that 
S(fo; P) = s(fn; Q) and S(fo; P) = S(f,; Q). For item (b), use Proposition 10.19, 
together with Problem 4, page 358, and Lemma 10.17. 


. First of all, show that it suffices to consider the case f = 0. In this case, if 


there exists x9 € [a,b] such that g(xo) > 0, use Lemma 8.20 to show that 
iP ‘ g(x)dx > 0. 


. Conclude first that — f? |f(x)|dx < [? f(@dx < f? \f(x)|dx. Then, finish the 


proof. 


. Adapt, to this more general case, the approach to the case of f = g in [a, b), as 


presented in the text. 


. For the first part, use the result of Problem 4, page 358, together with 


Proposition 10.21. For the second part, use the result of Proposition 10.13. 
If M > Ois such that |f(x)| < M for every x € [a, b], then 


ate b 
[tears [rood 
b-€ 


a 


[peo 


=f veoars [vena 


b b-e 
[ fooa- ds Sf (x)dx 


Ss 


b 
+ | Sf (x)dx 
b-e 


ate b 
< / Madx + Mdx = 2Me. 
a b-e 


Therefore, lim. ( J? f@adx — ors f(x)dx) =; 
From Le (x) — tg(x))?dx > 0 for every t € R, conclude that Ar?’ — 2Br + 
C > 0 for every t € R, where A = ie g(x)?dx, B = SL? Fodgxdx and C = 


> ¢(x)2dx. Then, note that A = 4(B2 — AC) < 0. For the last part, use the 
di P 
result of Problem 6. 
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First consider the case in which f is positive, and look at both (2) (b—a) 


and ii (x)dx as the areas of certain regions in the cartesian plane. For the 
general case, note that changing f by f + c amounts to adding c to both sides of 
the inequality in the statement of the problem. 


Section 10.5 


10. 


. In both cases, use (10.38). Additionally, for item (a) note that x b> ix 


. If F(x) = f 


. For item (a), use that cos? t = $(1 +cos(21)); for (b), write cost = 1—2 sin? 5; 


for item (c), integrate by parts. 


. For item (a), integrate by parts; for (b), integrate by parts twice. 
. For the first integral, if m 4 n write sin(mx) sin(nx) = $(cos(m—n)x—cos(m+ 


n)x); if m = n, use the fact that sin? nx = (1 — cos(2nx)). For the second 
integral, argue in an analogous way. 

3/2 ig a 
primitive for x +» ./x in [0, +00); for (b), note that x +> tanx is a primitive for 
xr» sec? x in (—3, 3 


297: 


. Start by taking a primitive F for f and computing the given integral with the aid 


of the FTC. 
i f(t)dt, use the result of the previous problem to show that F’ 
vanishes identically. 


. Let f(x) = mx +n, compute f' ”(mxn)dx with the aid of the FTC and, then, 


check that the result coincides with the ordinary formulas for the area of a right 
triangle (if f vanishes at x = a or x = b) or for the area of a right trapezoid (if 
f is positive in [a, b]). 


. Letting F be one such primitive and a < x < y < b, use the interpretation 


of p f(t) dt as an area and the monotonicity of f to compare the differences 
F(y) —F (2) and F (2) — F(x). Then, apply the result of Problems 8 
and 9, page 338. 


. Let f(x) = + 1)" = Vireo (f)*, x € R. Compute /; f(Adt in two different 


ways to get 


3 n\ xt! _ @t+irtt-1 
C= Nkpk+ n+l - 


Then, integrate both sides from 0 to | to get 


. 1 n\  2"?_(n4+2) 1 
2 erperD i) a1 aa 


Apply Lagrange’s MVT to the primitive of f based at the point a. 
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ioe 


12. 


13. 


First of all, use Corollary 10.33 to get 


b x=b b b 
/ foyrar = fe"| / Oxf (xf (dx = — / Oxf (x)f’ (xd. 


Then, apply Cauchy inequality for integrals (cf. Problem 12, page 381) to x 
xf (x) and f’. 

Firstly, use the computations in Example 10.29 to show that there exists a single 
jo € F of the form fo(x) = Acosx + Bsinx, with A,B € R. Then, expand the 
integrand in the inequality iN (f(x) —fo(x))?dx = 0 to get the desired result. 
Since 0 < cosx < | for x € [0, 5], we have (cosx)"*! < (cosx)" for every 
Int ge (cos x)” Hay 

In ee (cos x)"dx 

other hand, we saw in Example 10.35 that [,4, = 


integer n > 0. Hence, < 1 for every n > 0. On the 


ai for every n > 1. 


I a a 2 
Hence, 1 > “ft = FP— = = ht & #1, so that each one of the 
hh 2 n-—1 In—2 Ih- 


sequences (Bs : 7) and ( an ) is nondecreasing and bounded from above, 
k>1 1) k>1 


thus convergent. Letting £9 and £; respectively denote their limits, we have 


fr In— =I 
£o,£; > 0. Now, again by (a), observe that al - Fe’ = (4) : 


Tn wel Tn-1 
Therefore, letting m = 2k — +00 we get i = 6) ' or, which is the same, 
fol, = 1. However, since fat < 1, it easily follows that £9, 2; < 1. Thus, 
1 = @¢; < 1-1l=1, which gives us fb) = £, = 1. Finally, since both 


sequences (BS 7) and ( me ) converge to 1, we conclude that ( es 
k>1 2k k>1 nl /p>1 


also converges to 1. 


Section 10.6 


. For item (a), use the substitution x = t — 1, together with the fact that Tr — 


p/2 = 4 (45); for (b), use the substitution x = sint; for item (c), use the 


dauiten x= Jt. 


. For (a), make the substitution y = 1 + x; for (b), integrate by parts and use the 


result of (a); for item (c), make the substitution x = sint and, then, integrate 
by parts; finally, for (d), perform the substitution x = f° and, then, integrate by 
parts. 


. In both cases, write [“ f(x)dx = ie f(x)dx + Jo f(x)dx and, for the first 


integral, perform the substitution x = —t. 


. For the first integral, note that the sine function is odd and apply the result of 


the previous problem. For the second one, argue in a similar way. 


. Assume (without loss of generality) that O(0, 0) is the center and A(—a, 0) and 


A’ (a, 0) are the endpoints of the major axis of the ellipse. It’s a well known fact 
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2 


2 2 . 
that, in this setting, the equation of the ellipse is given by + oy = |. Letting 


f : [-a,a] > R be given by f(x) = b,/1 -— x, show that the area of the ellipse 


equals 2 A ke f(x)dx. Then, perform the trigonometric substitution x = acost, 
with 0 < ¢t < z, and compute the integral. 


. Suppose that g is crescente (the case of a decreasing g in completely analogous). 


Let t) < tp < +++ < t, be the points of the interval [g(c), g(d)] in which f is 
discontinuous, and c < sj < sy <-++ < sg < d be such that g(s;) = 4 for 1 < 
j<k. Let fi : [G, 541] > R be such that fj = f in (4, G41), 4%) = lim,+fO 
and fi(tj+1) = lim,+,,,—f(d), so that fj is continuous. Use Lemma 10.17, 
together with Theorem 10.37, to get i f(@dt = J f(g(s))g’(s)ds for 


1 <j < kand, analogously, Sriof Oat = | f(g(s))g’(s)ds and 8 F (dt = 


ff i f(g(s))g'(s)ds. Finally, add all such equalities to get the desired result. 


. First of all, note that f(a) = a and f(b) = b. Hence, integrating successively 


by substitution and parts, we get 
b f(b) b 
[reoas=f[ Pea=f roorewd 
a f(a) a 
b x=b b 
=f r@a=seol_ -f roa 
b 
= bf(b) — afta) — fF) dv 


b 
— pa [ F(x) dx. 


. Start by observing that, since p has degree n, it is reciprocal if and only if 


p(=) = pe for infinitely many nonzero real values of x (here we are using the 
fact—to be proved in [5]—that two real polynomial functions which assume 
the same values for every x € A, where A C R is infinite, are actually 
identical). Now, assume the equality of the statement holds for every x > 1. 


Then, differentiating both sides with the aid of Problem 5, page 391, gives 


: 1/3 1 ‘oh G 1) — pe 
ol — — tte (- +), which is the same as p(+) = “© for every x > 1. 


Conversely, assuming p to be reciprocal, the variable substitution t > + gives 
x t 1/x 1/t 1 1 t 
/ moat = | p(1/t) (-a)a= PUD) n/2-1 94 
1 p/24+1 1 (1/1)"/2+! a ifs rf 


_ fi p@ 
~~ ve p/2+l- 
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9. 


10. 


Since f’ > 0, we conclude that f is increasing. Therefore, f(x) > 1 forx > 1 
and the FTC gives, also for x > 1, 


A 


% 1 = sa 
-1l= ——— dt >~—_dt = arctanx —- — < —. 
f(x) i PD +fe <|[ Pa arctan x A < 4 


Finally, since f is increasing and bounded above by 1 + 4, we are done. 
The change of variables formula gives, together with the semi-invariance of f 
over g, gives 


g?) (x) g(x) ; g(x) 
/ fae [o see'oa = [roar 
g(x x x 
Therefore, if g™ = Id;, then 

m—1  pgkt) (x) 


0 = / f(dt = 2 | - f(tdt 


: 1 pg(x) 


f(pjdt = mf f (dt. 


x 


Section 10.7 


1. 


4. 


Since y = log,x @ x = a’, we have log, x-loga = yloga = loga = 


logx and, ila log, x = bee on the other hand, Theorem 9.28 furnishes 
1 


log’ x — A) = Fie = sane Now, item (a) follows from the first part 
of (10.60), for, loge = 1. Item (b) follows from the second part of (10.60), 
for, log, is increasing + log’ x > 0 for everyx > 0 } Tons > 0 for every 


x > 0 } loga > 0 for every x > 0 © a > 1. Concerning (c) and (d), 


we have log, (xy) = ee = ees = ea + ce = log, x + log, y and 

I loge _ logb 
log,c = ice = ied . ae = log, c - log, b. Finally, (e) follows immediately 
from (d). 


. Taking natural logarithms, conclude that it suffices to compare — | and “£2. To 


this end, study the first variation of f : (0, co) > R given _ f@® = = logs ~, 


showing that x = e is its only point of global maximum. 


. Taking natural logarithms, show that it suffices to find out the solutions a,b € 


184 _ 108 


N of the equation 
previous problem. 
Apply the formula of Corollary 10.33. 


. To do so, proceed as in the hint given to the 
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5. 


10. 


11. 


12. 


13. 


14. 
15. 


16. 
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Let f(x) = x — log(1 + x) and show that f is increasing. Then, write f(x) = 
x (1 - elt) and use L’ H6pital’s rule to get lim,_,94 f(x) = 0. 


. It suffices to observe that, since log is increasing and = > ./xy for every 


x,y > 0, we have log (=) > log /xy = gx TRY with equality if and only 
ifx=y. 


. Since f(x) = e* + 6ax, we wish to find all a < 0 such that e* + 6ax has a single 


real root. Arguing geometrically, conclude that it suffices to find all a < 0 for 
which the straight line y = —6ax is tangent to the graph of x +> e’. To this end, 
use the result of Problem 11, page 301 (with (a, b) = (0, 0)), to conclude that, if 


(xo, e*’) is the point of tangency, then e*? = —6axp and —6a = fe =e, 
x=xo 


. Recall that x* = e®!°8* and apply the chain rule. 
. Adapt, to the present case, the hint given to the previous problem to get h’(x) = 


(Fx)e/@) log fa) +f Wa) feo". 

Study the first variation of f to conclude that x = an ; is its only critical point. 
Then, use Weierstrass’ Theorem 8.26 to conclude that such a point is its only 
maximum point. 

Let a = b = V2 and analyse the possibilities a’ rational and a? irrational, 
observing, in this last case, that (a?)’ € Q. 

The continuity of the natural logarithm guarantees that it suffices to show that 


lim xlog (1 + “) =a 
Xx—>+00 x 


or, which is the same (letting y = +), that lim,_.94 


log(1+ay) 
: y 
use |’ H6pital’s rule. 


= a. To this end, 


x X 10; ata . _ . 
Since (44)" = e’ (2) we wish alla € R such that lim, +90 x log ( 
1 or, which is the same, lim,—.+o0 x log ( an ) = 1. Letting y = +, conclude 


tos( 12%) 


¥ 


=) = 


x—a 


IS 


+15 


that we wish to find all a > O such that lim,_.94 
1’ H6pital’s rule. 

Differentiate g(x) = e~*f(x) and then apply Proposition 9.57. 

For (b), use the result of (a), together with the FTC. Finally, for item (c), we 
have [ sec* tdt = f secr(1 + tar’ ndt = f sectdt + f(tant- sect) tantdt = 
—log| sect — tant| + f sec’r-tantdt = —log| sect — tanz| + sect - tant — 
f sect + tan’ tdt. 

For the second part of (a) use the chain rule, together with the fact that fe = 
e*. For item (b), use that cosh” x = coshx > 0, coshx = ote > Je-e* = 
1 and lim,_,4.. coshx = +00. For (c), show that sinh’x = coshx > 0 and 
sinh” x = sinhx, with sinhx > 0 if and only if x > 0. In item (d), you should 
get something similar to Fig. B.8. For item (e), letting k(xo) denote the curvature 
we wish to compute and f(x) = coshx, conclude from Problem 3, page 344, 


— cosh xg cs 
that k(xo) ~~ (1-Fsinh? x9)3/2 ~~ cosh? x9 ” 


= |. Finally, apply 
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Fig. B.8 Graphs of the 
hyperbolic sine and cosine 


17. 


18. 


19. 


20. 


Since x? —y? = 1 Sy = +vV x? — 1, we have 
1 cosht 
A= / (tanh t)x dx + / ((tanh t)x — Vx? — 1)dx 
0 1 
1 1 é 
= 3 tanht + (tanh f) - 3 (cosh? — 1) -| V cosh’ s — 1 sinh sds 


1 1 t 
5 sinh t- cosht — an (cosh(2s) — 1)ds = 5 


Write x* logx = 


=e, with y = +. Then, adapt the proof of Theorem 10.51 to 


the present case to conclude that ae <= = 0. 


Rework the proof of Theorem 10.55 to get 


n 


Firstly, note that f(1) = log(3e/8) > 0, by Problem 8, page 242. Now, write 
f(x) = log (42) —log (1+ +)" + 1 to get lim, 400,f(x) = 0, and conclude 
that it suffices to show that f’(x) < 0 for x > 1. For what is left to do, compute 
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f@= aa + log (<5). f’@ = = - REarST Es and conclude that f” is 
increasing, with limy-, 460 f’(x) = 0. 

21. Apply Stirling’s formula. 

22. Use Stirling’s formula to get 


(K+ 1)n 1 =a wf, yk 
( )- ———(k + "(i+ 5) 


n 27mn 


Then, note that k > an > k > +c00 asn > +00. 


Section 10.8 


1. Note that a = 2 and a = 4 are two roots. To show that there are exactly three 
real roots, start by sketching the graphs of x > x* and x +> 2*; then, use the 
IVT to show that there exists xy) € (—2,0) such that a, = 2”, Finally, assume 
that xo is rational and derive a contradiction. 

2. Firstly, note that f(x) = log (x + x? + 1) is increasing in R, so that the given 
system of equations can be written as 


g(x) =y, g(y) =z, and g(z) =x. 


Assume, without any loss of generality, that x = max{x, y,z}. Then, succes- 
sively conclude that f(x) = max{f(x), f(y), f(z}, y = z,x and y = x. Arguing 
in an analogous way, show that x = y = zand solve log(x + Vx* + 1) = 0. 

3. By the sake of contradiction, differentiate the equality p(x) = log x and, then, 
use the results of Example 9.17. 

4. Let k € Z4 be such that 10* < n < 10**! and conclude that k = |log,, 7}. 

5. We want that 


<pk< 
(a\az...Am00...0)10 Sn = (ajd2...dm99...9)10. 
1 


for some n,/ € N. This is the same as 10!p < n* < 10/(p + 1), where 
DPD = (a\a2...4m)\o. Take 1 = kq and show that, in this case, the last 
condition above is equivalent to the existence of a natural number n such that 
n € [10% 4/p, 107X/p + 1), for which it suffices to have 107(4/p + I— ¥/p) > 1 


or, which is the same, g > — log,)(/p + I — ¥/p). 
6. Start by showing the validity of the inequalities log, (log, b) > log.(log, b) and 


log, (log..a) > log. (log, a). Then, apply item (e) of Problem 1, page 411. 

7. Use the given functional equation to show that, if a9 = 0, bo = 0, an41 = 
2a, + 1 and by4) = 3b, + 5, then f(a,) = b, for every n > 0. Then, use the 
material of Sect. 3.2 to conclude that f(2” — 1) = 3(3” — 1) for every n > 0. 
Finally, solve for n the equation 2” — 1 = x. 
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8. Start by showing that f(0) = 0, 1 or 2. In the first two cases, show that f is 
constant. In the third case, show that f(m) = 2m whenever m is a power of 2. 
Then, take n < m and suppose that f(m) = f(n); use induction to show that 
f(m") = f(n™) and, to reach a contradiction, choose a,b,c € N such that 
logsn< £< £4 <logym. 

9. Initially, use the formula of integration by parts to get 


1 xt 1 nx?-1 
n dx =n xX: dx 
o +a 0 x"“+a 


a 
a , x- — log(x" + a)dx 
0 dx 


1 
= log(a+ 1) -{ log(x” + a)dx. 
0 


Now, observe that 


1 
loga =a log(x” + a)dx 
0 


1 x" 
= | (Ioga + tog (= +1)) as 
0 a 
1 yn 
x 
<loga+ [ —dx. 
o a 


Finally, take together the two computations above to compute the desired limit. 
10. Use the power means inequality. 
11. Apply Jensen’s inequality to the natural logarithm, as in Example 10.61. 
12. Apply Jensen’s inequality to f(x) = xlogx, x > 0. 
13. Apply Jensen’s inequality to f(x) = log iz, 0<x<l. 


14. Apply Jensen’s inequality to f(x) = log se x € (0,7). 
15. If a; = e*, with x; > 0, show that f(x) = sh, 
apply Jensen’s inequality. 


16. Start by writing the given inequality as 


=. tt 1 Ss 
ear a 
l—-a~ nt! 1-s 


i=1 


x => 0, is strictly convex and 


where s = a, + a2 +-+++ a,. Then, rewrite the inequality as 
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If a; < 5 for 1 < i <n, apply Jensen’s inequality to f(x) = log(-4),0 <x < 
3 subsequently, show that 


nog (—— -) + (n+ 1) logn < log (— 


17. LetS = (i+ bit)" and write 


= 0 +b)! + 3 bi(a; + bE. 


i=1 i=1 


Then, choose g > 0 such that 7 1] - z | and apply Hélder’s inequality to each 
of the sums above. 


Section 10.9 


1. For item (a), use the result of item (b) of Problem 15, page 412. For (b), 
apply the comparison test for improper integrals, together with the fact that 
le‘ sint| < e~ for every t € R. For item (c), note that x — 0 when t > 0+ 
and apply the comparison test whe? t — |-. Finally, item (d) follows from the 


fact that arctan is a primitive for ; a: 


2. Apply Theorem 10.37 to compute eG dt for x > B. Then, let x > ++oo. 

3. Adapt, to the present case, the argument of Example 10.70. Alternatively, apply 
the change of variables mere for improper integrals (previous problem) with 
the variable substitution t +> —, that maps (0, 1] bijectively to [1, +00). 

4. For a > 7”, the change of vaHables formula, together with the formula for 


integration by parts, gives 


a Ja 
/ cos(x*)dx = / t'/? costdt 
x ca 


va 1 v4 
=f !/? sint + >| t>/? sin t dt 


7 apt: 


sin ./a i. 
= +2 
a 2 In 


Now, argue as in Example 10.75 to show that ae ah dt converges. 


5. For the convergence of the integrals, note first that (cf. Problem 4, page 325) 
sinx > x— x“ for x > 0. Then, for 0 < x < /6, 
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0 > log(sinx) > log (x- “) = logx + log (1 = =), 


Since both integrals de log x dx and is log (1 = de converge, the same is 


fe 3 : . 
true of /," log (« - =) dx. Given 0 < a < B < V6, it follows from the above 
inequalities that 


B B x3 
|| log(sin x)dx| < [| log (x-=)ar. 


Therefore, . log(sin x)dx converges. Now, use the change of variables formula 
to show that 


/ log(sinjax = [ log | cos x|dx, 
0 0 


so that Pi log | cosx|dx also converges. Finally, let J = i log(sin x)dx and 
using the change of variables formula again, we compute 


2n 1 
21 = 2 | log | sinx|dx = 2 | log | sin(2y) |dy 
0 0 
= | log (2 sin y| cos y|)dy 
0 
= 27 log2 + 27+ 2 | log | cos y|dy. 
0 


Then, /," log | cos y|dy = —z log 2. 


6. Apply the integral test to each of the functions x > nd 


1 i 
(log x) ’ ae x(log x)” a 


1 
x(log x) (log log x) * 
7. First recall that, for some A > 1, we have logt < /t for t > A. Therefore, 


logt vt vt _ 4-3/2 +00 3/2 
ne Sie <p = for t > A, and hi t_~/“dt converges. Now, take 


x > 1 and use the result of Problem 5, page 391, as well as Corollary 9.47, to 
show that f : (0, +00) > R, given by f(x) = /, ifs pesdt, is constant. Deduce 


that Sins esdt = 0 and, then, that i 2 oe dt converges. Finally, compute 


+oo logt dt=0 
0 1-e00 7 
8. For item (a), apply Young’s inequality (cf. Proposition 10.62) toa = f(t) and 


b = g(t) and then integrate the result on J. For (b), start by observing that we 
can assume that Mow dt > 0 and via g(t)4 dt > 0. Then, let f, = f/A and 


1/p 1/q 
21 = g/B, where A = (fe 00" dt) and B = ( g(t)4 dt) . Finally, show 
that Fh A@? dt = f : gi(t)? dt = 1 and apply the result of item (a). 
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9. 


10. 
11. 


12. 


1 ee oe 
eee = 2 
< 0, so that f is indeed decreasing; on the other, litt. s ei) = 


For item (a), on the one hand we have f’(x) = 


1 
x(x+ 1) 
lim,++00 (4 — log (1 + +)) = —log1 = 0. For (b), given x > 1 and using 
the result of Problem 4, page 412, together with Theorem 10.53, we get 


[ toa = - (+ -toe( + 1) + 17) dt 


t=x (=x 
= log, = (+ Dloge+ D- + I) 
= — 
t=. 
+ (tlogr—0| 
t= 


1 1\ 
=-1e (1+ =) -toe(1+ =) + 2log2+1 
x x 


xX—>+00 


—> —logl— loge + 2log2+4+ 1 = 2log?2. 


For item (a) adapt, to the present case, the proof of the integral test. 

For item (a), argue geometrically to show that if 27-4 <n<nt+l< ae: na 
then 2in + 2 <n+2<2In+ , and if 2izr + > Ep cnel <2 
then 2/7 + = <n+2 < 2ln ‘ un . For item (b), use (a) to conclude that 


| sin n| | tei | sin(n+2| 
n + n oF n+2 pa Ter 


series to show what is asked, Finally, for item (c), for each k € N show that 
1 k+1 m-% 

ee lat heag let = gee 
asked again from the divergence of the harmonic sie: (Why can’t one use the 
integral test?) 
For item (a), study the first variation of f(x) = (1 — x)e* in the interval 
[0, 5]. For (b), use the Fundamental Theorem of Arithmetic (cf. introduction 
to Chap. 1). Item (c) follows from items (a) and (b), together with the fact that 


a use the divergence of the harmonic 


=, Then, conclude what is 


-1 
1+ 7 + a ee (1 - 1) . Finally, for item (d) use the divergence of the 
Dy D, 


harmonic series. 


Section 10.10 


1. 


2 


Setting f(x) = coshx in (10.71), we have v1 + f’(x)? = coshx, so that the 
=X0 


desired length equals J, ~ coshxdx = sinh x} = sinh x9. 


ees (a) ahs sedate from (10. 71. "For item (b), use the fact that 
a = b’ + c’, together with the fundamental relation of Trigonometry to write 
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b2x2 b?cos?t =a sin?’ t + b* cos? t 


ia Si 
a*(a2 — x?) a sin’ t a sin’ t 


b? + c? sin’ t b \? Ce.) 
= ——_—_ = - 1+ (=) sin’ t}. 
a? sin’ t asint b 


3. For item (a), observe (cf. Fig. 10.12) that after I has rolled to the right by t 
radians starting from O, the point of tangency between I and r will be T(t, 0) 
and the center of I will be A(t, 1). On the other hand, angle ZTOP, measured in 
the counterclockwise sense, will be equal to ¢ radians. Since BAP = a —f¢ (in 
the counterclockwise sense), it’s immediate that 


x(t) =1 +005 (2-1) =t—sint, y(t)= 1+ sin (= -1) = 1-cost. 


For (b), first note that 


20 20 Qn y(t)? 
: VI +f'(x)2dx = i V1 4+f' (x(t))2x (dt = : 1+ x’ (t)dt 


x! (t)? 


2n Qn 
= / Vx (1)? + y(t)? dt = v3 | V1 —cos tdt. 
0 0 


For the computation of the last integral, see item (b) of Problem 1, page 397. 

4. By definition, we have r(5) = ta e't—'/? dt. Now, perform the substitution 
of variable s = t'/? and apply the result of Problem 2, page 435. 

5. We start by using the result of Problem 18, page 413, to conclude that there exists 
A > O such that ¢2|logt| < A for 0 < t < 1. Then, observe that 


1 1 1 
i et '|log¢| a= | et?! . £2|logt| a<a | e't2 | dt. 
0 0 0 


Now, use the fact that logt < t fort > 1 to write 


+00 +00 FRO 
i ef] log t| dt = / ef! logtdt < it er dt. 
1 1 1 


Finally, gather together the above estimates to obtain 


+00 1 7 +00 
/ er '| logt| dt <a f etas f e ‘ft dt 
0 0 1 


+00 7 +00 
<a [ e'f2! a+ | er dt 
0 0 


= Ap () +T@t1)) 
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6. 


For the second part, let n; = j for | < j < k. For the first part, use the fact that 
log T is convex, together with Jensen’s inequality. 


Section 11.1 


_ 


Nn 


. Use version (11.2) of the Taylor’s formula of f centered at xo, together with the 


fact that f” > 0. 


. Assume f(x) > 0 (the other case is entirely analogous). Since f is 


continuous and J is open, we can take an open interval J C J, centered at x) and 
such that f (x) > 0 for every x € J. Fora fixed x € J \ {xo}, (1. ) guarantees 
the existence of a real c between xo and x such that f(x) = f(xo) + ii “(0 (x—X0)". 
Conclude that f(x) > f(x). 


If f@® = V1l+«sin*t forO < t < a, then f’() = eaten) Now, 


(1+« sin? 1)3/2 
look at (10.74) and substitute the formula for f in both sides of the inequality 


of Example 11.2, noticing that |f’(0)| < «(k + 1) forO<t<7z. 


. The Taylor series of the exponential oe together with the definition of 


sinh x, furnishes sinhx = $(e* — ey=4 a ee a a ak x = = ys aye. 
Argue analogously to cosh x. 


. Apply the result of Proposition 11.3, with n + 1 in place of n. 
. Let p(x) = ayyx" + nx"! +++++ a,x + a9 and assume a,, > 0 (the other case 


can be dealt with in an analogous way). By Example 9.17, we can take A > 0 
such that p(x) > 0 for x > A. Then, for x > A, apply (11.6) with n = m+ 1 to 
get the desired result. 


. For item (a), since f(B),f’(B) > 0, we clearly have y < f; now, use the result 


of Problem | to conclude that f(y) > 0, and hence y > q@. For (b), use the result 
of (a) to show that (a,),>1 is decreasing and such that a, € [a,a,] forn > 1, so 
that there exists £ € [a, a)] satisfying a, > & as n — +00; then, let n > +00 
in the recurrence relation defining (a,),>1 to get £ = a. For (i), use (11.2) to get 
f(a) =f (qn) +f’ (an) (@ — an) + sf" (En) (ao —adn)* for some &, € (a, dy). For the 
equality in (ii), use again the recurrence relation of (a,)n>1; for the inequality, use 
the fact that f’ is increasing. Finally, for the first part of item (iii), use the result 
of (ii); for the second part, make induction on n, noting thatO < a,-—a <d-—c. 


. Verify that f’,f” > 0 in [2, 3] and max 5) a = 2. In the notations of item 


(i111) of the previous problem, conclude that we can let c = 2 andd = 3 to get 


it 5 2an+5 
0<a—a < (2) m for every n > 1, where a; = 5 and dn+1 = a= 


5: Finally, 
note that Cy. a < 107°. 
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Section 11.2 


1. We saw in Example 11.11 that (th y< x for every x € R. Hence, letting 
M, = x se and noting that visi M; < +00, it follows from Weierstrass M-test 


that dos (a) converges uniformly in R. Now, Proposition 7.38 gives 


142) 1-—-) -x41° 


kel T2 


2. In all cases we shall use Weierstrass M-test. For the Taylor series of the 
: ; es ' 
exponential function, we have la*| < & if |x| < a. Therefore, letting 
k k 
M. = & fork = 0, we have eso Mk a Peon = e* < +o, and the 
M-test guarantees the uniform COnVETER Ce i the sence in the interval [—a, a]. 
(<p! 32i- ai 
(Qj- me = cm 


Mz; = 0 forj > 0 and Mj) = i a for j = 1, we get Dis9Mk = 
2) 


Vere = Ver i = sinha < +00. Again, the M-test assures the 
uniform convergence of the Taylor series in the interval [—a, a]. Finally, for the 
Taylor series of the cosine function a similar argument is valid. 

3. For the non uniform character of the convergence, show that the maximum 


For the sine function, we have | if |x| < a. Letting 


value of f;, is e~', regardless of the chosen n. In order to compute i Tn(x) dx, 
integrate by parts. 

4. For item (a), note that x” —"+ 0 for 0 <x < 1. For (b), integrate by parts to get 
fo fal) dx =—e!+ n fy fr) dx for every n € N; then, write fhe 63 dx = 

e!, i Fn-1 (x) dx = an} —b,—,e~! and use the fact that e~! is irrational. 
For item (c), make induction on n, using the recurrence relations found in (b). 
Finally, for (d), use the fact thate =), G- 

5. First show that the definition of uniform convergence guarantees the existence 
of C > 0 and no € N such that n > no => |f,,(x)| < C for every x € [a,b]. 
Then, conclude that |f(x)| < C for every x € J, so that f is bounded. To show 
that f is integrable let € > 0 be given and choose mp € N such that n > no 
implies |f,(x)—f(x)| < ¢€ for every x € I. Fix such ann and invoke Theorem 10.4 
to choose a partition P, = {a = x) < x1 <-+++ < x, = b} of the interval 
[a, b] such that S(fr; Pn) — 8(fni Pn) < €. Noting that mj(f,) < fr) < Mfr) 
for x € [x;-1, xj], conclude successively that mj(f,) —€ < f(x) < Mj(f,) + € 
for x € [x-1, xj], mj(fn) — € < m(f) < M(f) < Mf.) + € for 1 <j < k and 
S(f; Pn) —s(f; Pn) < €(1+2(b—a)). Finally, let I, = f? fx(dx, 1 = f? fax 
and show that computations analogous to the ones above furnish J, —I < «(1+ 
(b —a)) andl, —I > —e(1 + (b—a)), so that |J, —I| < «(1+ (b-4)). 
Alternatively, let D,, be the set of points of discontinuities of f,, and D that of f. 
Show that D C ,,., D, and apply the result of Problem 3, page 367, together 
with Lebesgue’s Theorem 10.10, to show that f is integrable. Then, argue as in 
the proof of Theorem 11.14 to show that te Sf (x)dx = limy-++00 Pi Tn(x)dx. 
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Start by using Lagrange’s MVT to get |fc(x)| < [fc(x) — fk (xol + |fe(xo)| < 
My|x — xo| + [fe(xo) | < Mi(b — a) + |fi(xo)|. Then, apply the M-test with M;, = 
M,(b—a) + |fi(xo)| in place of M; to show that }°,. | f¢ converges uniformly in 
[a, b] toa function f : [a,b] > R. The M-test also shows that )~,.. , f/ converges 
uniformly in [a, b] to a function g : [a,b] > R, and we claim that f’ = g. To 
this end, let xo, x € [a, b], with x # xo, and S = |=o) _ g(x0)}. With the aid 
of Corollary 11.19, write 


_ ae -¥ f soar 2(x0)| 
k>1 2. J., 
“lal, cana _ [0 aay 
= x — Xo| | [ ag) _ a(x)|dt|. 


Now, use the continuity of g to show that this last expression can be made less 
than €, provided x € [a, b] is chosen to be sufficiently close to xo. 


. Use the given condition to conclude that pe f(@)p(x)dx = 0 for every real 


polynomial p. Then, use Weierstrass approximation theorem to find a sequence 
(Pn)n>1 Of polynomials such that p,, me f uniformly on [a, b]. Finally, conclude 
from this fact that [” f(x)dx = 0. 


. If such an f did exist, we would have ie f@p()dx = p'(0), for every 


polynomial function p. Now, if F(x) = i f(ddt, then F’ = f and F(0) = 
F(1) = 0. Integrating by parts, we get i, F(x)p'(x)dx = —p’(0), which is the 
same as [; . F(x)q(x)dx = —q(0), for every polynomial function g. Then, follow 
the hint given to the previous problem to get 


1 1 
/ F(x)?dx = lim / F(x)gn(x)dx = — lim q,(0) = —F(0) = 0, 
0 n>+o0 Jo n—>+oo 


so that F, and hence /, vanishes identically. But this contradicts the fact that 


is f(x)xdx = 1. 


. Given € > 0, use Weierstrass approximation theorem to choose a polynomial p 


so that |f(x) —p(x)| < € for every x € [0, 1]. Note that lim,_, +95 to P(X) Gn(x)dx 
does exist does exist and estimate 


1 1 1 1 1 
| / SQm = ; Sn Ss | €(Qn + Qn) + | / PPm = / PPn . 
0 0 0 0 0 


Now, use the fact thatn > : @, and n b> i Pn converge, together with 


the computations above, to show that n b> bi F@n is a Cauchy sequence. 
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10. 


LI; 


12. 


13. 


14. 


15. 


16. 


For item (a) and for x € (0, 277), use the result of item (a) of Problem 10, page 
359, to get 


(n—1)x \ os (nt+1)x 
sin (Ate 5 ~ sin — 


: sin(kx) = 7 
ny 


Then, conclude that hae = sin(kx)| < Te 


(b), note that for 0 < 8 < m and x € [6, 2 — 4], one has | sin 5| = sind. 

For the first parts of items (a) and (b), integrate by parts. For the second 
part of (a), use the triangle inequality for integrals to get |az(f)|, |bi(f)| < 
xo = \f(x)|dx for every k > 1. For the second part of (b), use again the 
fongle inequality for integrals, together with the M-test. Finally, for item (c) 
use Corollary 11.19, together with Problem 3, page 390. 

If a,(f) and b;,(f) are as in the statement of the previous problem, compute 
the integrals that define such numbers to get ag(f) = 2a" ,a = cy and 
by = 0 for every k € N. Then, use Fourier convergence theorem to write 
fw= mart + Ves &(A) cos(kx). Finally, compute f (77) = a” with the aid of 
such a series. 

Again in the notations of Problem 11, compute the appropriate integrals to find 
ao(f) = 1, an(f) = 0 forn > 1, by (f) = 0 fork > 1 and by_\(f) = aT 
for k > 1. Then, evaluate the Fourier series of f at oe with the aid of Fourier’s 
convergence theorem. 

For item (a), apply the change of variables formula to g, followed by Riemann’s 


Theorem 10.7. For (b), write 


and apply Abel’s criterion. For item 


n B 


Dp 
[Preoemmar= > f, yf CD8LODAe 


k=1 
Then, given € > 0, use the uniform continuity of f to find ng € N such that, for 
n> No, 


® 


(ro -1(®)) eempa 


(k— ep 


Pp 
< | |g(nx)|dx < ep( max isl). 
0 [0.p] 


First of all, it follows from Problem 6, page 391, that a(f) = + fo" FR) cos 
(kx)dx. Then, use the result of the previous problem. For b;,(f), argue in a 
similar way. 

Since a(f) = ag(g) and by(f) = bye(g) for every k, we have Sif = Sxg for 
every k, so that 


1 n 1 n 
Onf = nae S— Sif(@) = nae S > Sig(x) = On§ 
k=0 k=0 


for every n € N. Now, Fejér’s theorem guarantees that o,f — f and ong sy g 
(even) uniformly on R, so that o,f = o,g implies f = g. 
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We are going to show that f is a constant function. To this end, first note that if 
g(x) =f (+), then g is continuous and periodic of period 27, and f is constant 
if and only if g is so. Note also that g(x) = g(x+2m V2) for every x € R. Now, 
let 9 + esi (ax cos(kx) + bx sin(kx)) be the Fourier series of g. Using the 
change of variables formula, together with the result of Problem 6, page 391, to 
get, fork => 1, 


Ta, = a g(x) cos(kx)dx = [ g(ixt+ 2nV2) cos(kx)dx 
n—2nJ/2 
= i g(x) cos (k(x —2n V2))dx 


—n—2n/2 


= [ g(x) cos (k(x —2n V2))dx 


II 


ia g(x)( cos(kx) cos(27rkV/2) + sin(kx) sin(2kv/2))dx 
= ma, cos(2akV2) + rb, sinQarkV2). 


Analogously, show that by = by cos(2kV/2) — ax sin(22k/2), and solve the 
linear system in a, and b; thus obtained to get a, = by = 0 for every k = 1. 
Finally, use the result of the pes problem to show that g is constant. 

Since f(xn,t) — f(xo,t) and £ On, th > of 7 (xo, t) asn — +00, we have 


lf (xo, 


of th f(x, f) andthe x (x, t) (for each xo € J) follow from the comparison test 
for improper integrals (Proposition 10.73). For the rest of item (a), fix xo € J and 
let (Xn)n>1 be a sequence of points in /, converging to xo. Let gn(4) = f(%n, 1) 
and go(t) = f(x, 1), so that go is the pointwise limit of g,. Apply Lebesgue’s 
DCT to the g,,’s and go to show that F(x,) — F (xo) asn — +00. For the rest of 
(b), first note that if a ” exists and is given by the right hand side of (11.18), then 
item (a) (applied to £ , instead of f) guarantees that F’ is continuous. Now, let 
(Xn)n>1 be a sequence of points in J \ {xo}, converging to x9. Lagrange’s MVT 
assures the existence of &, between x, and xo such that 


L(x, p| 2 G(t) for every t € J. Then, the integrabilities 


Fn) — FQ) _ a fn) ~ FR.) 5, 


Xn — X0 


B 
af (E,, t)dt. 
ox 


Xn — X0 


Let G, (ft) = #E,, t), Go(t) = £ a, t), so that Go is the pointwise limit of G,,. 
Apply Lebesgue’s DCT to the G,,’s and Go to show that 


F(&n) — F(xo) is +f" © (xo, tt 


Xn — Xo 
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18. Use the result of the previous problem, together with Problem 5, page 445. You 
may wish to use the following facts: if (x,)n,>1 is a sequence of positive reals 
that converges to the positive real xo, then (x,),>1 is bounded; in turn, show that 
this guarantees the existence of integrable functions go, Go : (0,1] — R and 
21,G, : [l, +00) > R such that er"! < go(t) and er"! logt < Go(#) in 
(0, 1], whereas e~'r"—! < g1(t) and et"! logt < G,(f) in [1, +00). 


Section 11.3 


1. For items (a) and (b), use the result of Corollary 11.24. For item (c), note that 
the radius of convergence of the series is at most 1, since it doesn’t converge 

k 
when x = 1. Then, note that for |x| < 1 we have )>,.)x* < +00, so that 


=e 1&2 is absolutely convergent. 


2. For |x| < 1, note that }°,.5 ie = Sk 4 < +00. On the other hand, for 
|x| = a > 1 and sufficiently large k, 


k k 2 2. k 
x a ak ak ak k 
Were a ee —> +00; 


hence, the general term of the series doesn’t go to 0 as k increases, and the 
series diverges. Therefore, its radius of a is 1. 


3. TEf(@®) = Viet ce! and g(x) = yet are x°*, then f and g are defined 
in the whole real line, and Theorem 11.27 gives f’(x) = g(x) and g’(x) = 
—f (x). However, since f(0) = 0 and g(0) = 1, it follows from Example 9.48 
that f(x) = sinx and g(x) = cosx. 

4. Apply Theorem 11.27 k — 1 times, starting from — a ei x”, Alternatively, 
expand (1 — x)~* with the aid of the binomial theorem. 

5. Expand (1 — x*)~!/? in a power series with the aid of the binomial theorem. 
Then, use item (b) of Proposition 11.26, together with the result of Problem 4, 
page 300. 

6. For item (a), observe that log ( )= = log(1 + x) — log(1 — x) and apply the 


ste 


— of Example 11.30. For item (c), we have = 3 forx = 5. Substituting 


1 

2 
a 1 1 1 1 

x= 5 in the formula of item (a), note that 4 stagteastan tap tom 


give 5 log 3 with four correct decimal places, since 


1 1 1 igi. U4 
981g 285 aay <a (grtget gt) 
4 Io 4 
~ 3-43-21 ~ 30-2000  60000° 
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7. For item (a), adapt the idea of the proof of Abel’s identity (5.19). More 
precisely, noting that a, = rx, — rp41, we have 


m 


Yi ae = Do aR = Dia - ety" 


k=n k=n k=n 
m m m m+1 
k k k k-1 
=) iy — resi = ony — > rey 
k=n k=n k=n k=n+1 
m m 
k k-1 
= + s Thy — a rey — Tm" 
k=n+1 k=n+1 
m 
n m k k—-1 
= (ny —Tm+1y )+ > rey’ =y ). 
k=n+1 


For item (b), since |y| < 1 we get 


m 
vax" S|" + |fm| + > Irel|y* — "| 


k=n k=n+1 


= 


m 


+ ITm| + > (sup [ri ) (v5 — y*) 


k=n+1 k>n 


IA 


r, 


= 


= |r, 


= 


4 [p(t (sup Iril) 0" _y"), 


k>n 


The first part of item (c) now follows from the fact that, on the one hand, |y| < 1 
gives 


m 
vax < [Pal + |rm| + (sup |ril); 
k>n 


k=n 


on the other, r,, => 0, Finally, for the second part of (c), note that the first part 
guarantees that f extends continuously to [0, R]. 


8. In Example 11.30, we saw that log(1 + x) = )iiso Eur etl for |x| < 1. Since 
yee cy converges (cf. Example 7.50), it suffices to use the result of the 
previous problem. 

9. Adapt the hint given to the previous problem, this time using (11.23). Alterna- 


tively, start by showing that 


1 
14+ x2 


(—1)"x?" 


=l-— Ph Ae earecree -] n—1y2n—2 4 
x x x (-1) ae 
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for every real x. Then, integrate both sides of the above inequality from 0 a 1, 
noticing that 


1 (—1)"x" 


0 1+ x 


1 x2 1 1 
x sf ee ; 
0 1+ x 0 2n + 1 
finally, let n + +00. 


10. For item (a), just note that so > s; > s; > ..., so that limj++4o sj; = inf{s;; 
j = 0} € RU {+400}. For item (b), assume first that lim sup a, = M € R. Then, 
in the above notations, M = inf{s;; j > 0}, so that, given e > 0, M+ € is no 
longer a lower bound for {s;; j > 0}. In turn, this implies the existence of j ¢ N 
such that s; < M + e, and thus a; < M + e for every / > j. This means that 
only aj, daz,..., a ;—; can be greater than M + e. On the other hand, the fact that 
M = inf{s;; j > O} gives sup{a;, aj41,...} = s; => M, and hence M—e is not an 
upper bound for {q;, aj+1,...}, so that there exists j; > j such that a;, > M—e. 
By repeating this argument with s;, +; = sup{aj,+1,@j,+2,-..}, We get jo > ji 
such that aj, > M — e. Then, proceeding by induction we construct a whole 
sequence j; < j2 < ... of naturals such that a;, > M — ¢ for every / > 1. The 
converse of (b) can be established in a similar way. 

11. For item (a), use item (b)ii. of the previous problem. For item (b), first show that 
lim sup */|azx*| > and then use item (b)i. of the previous problem. Finally, since 
items (a) and (b) show that the series converges in (—R, R) and diverges outside 
of [—R, R], we conclude that R is its radius of convergence, thus establishing 
(c). 

12. For item (a), Theorem 11.27 guarantees that xf’(x) = >> 


ii 
jel Wx” forO<x <1. 
Moreover, since 


ai 2) Ss x 4241 4 42 + age 


’ 


we conclude that 


2 
a 2i +1 Y+2 og yettry _ xk = # 
PX) > DI path ET pee tx ) > —_ 
j21 k>2 
Integrating the inequality f’(x) > +, along the interval [0, x], with x < 1, we 


get the desired inequality. 


Section 11.4 


1. Note that a) + dy +++: + a, = mif and only if xx” ...x% = x”; hence, 
conclude that there are as many solutions (a1, a2,..., a,) of the given equation 
as ways of getting a summand x” in 
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f@=OA4txt2?+--)dtet ete). texte t--). 
a tle 


k 


Next, show that f(x) = To Sei (“""")x" for |x| < 1, where in the 
last equality we used the result of Problem 4, page 483, to get the power series 
expansion of (1 — x)~*. 


. Proceeding as in the hint given to the previous problem, we want to compute 


the coefficient of x°° in 
(Lt xt te JQP HO te \Ltx tr tee) txt te +47). 


Letting f(x) denote such an expression when |x| < 1, conclude that f(x) = 
GP 4a a pee x) . ae Then, use once more the result of Problem 4, 
page 483. 


. Make induction on k > 2, the case k = 2 being given by Proposition 11.46. 
. Differentiate both sides of (11.29) and, then, make x = 1. Note that Theo- 


rem 11.27 assures we can do this. 


. Firstly, use induction to show that a, < n? forn > 2. Secondly, letting 


f@~ = Me, a,x* and noticing that the radius of convergence of the power 
series )>,.5 k?x* is equal to 1, conclude that f is defined in the interval (—1, 1). 
Now, use the given recurrence relation to show that f(x) = 2x + xf (x) + xg’(x), 
with g(x) = 74 for|x| < 1. From this point, get a closed formula for f(x), valid 
for |x| < 1, expand such a formula in power series and find a, in terms of k. 


. Adapt, to the present case, the proof given in this section to Theorem 11.45. 
. For item (b), compute 


f(x) =1+ Sat =1+ Yaw 


n=1 n>0 
=1+ Y>(Qan +n)x"t! 
n>0 
= 1+ 2xf(x) + Do(n¢ 2yx"t1 20! 
n>0 n>0 


j 2x 
= 1 + 2xf(x) + g(x) - = 


with g(x) = = For item (c), reduce both sides to a common denominator 
and, then, compare coefficients of the corresponding numerators. For item (d), 
use the result of Problem 4, page 483. 


. Adapt, to the present case, the hints given to the items of the previous problem. 
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9. Use identity (2.5) to get 


e =w+u+4+w + 3(e — ule —v)(e* —w) 


=wWt+utw te (uv + uw -4 vw) — uw. 


Let f = uv + uw + vw, g = uw? + v2 + w’ and show that f(0) = 0, ¢(0) = 1, 
f' =f +gand g’ = 2f. Then, use the result of Example 11.51 to find f and 
finish the proof. 

10. For the first part of item (a), use the results of of Theorem 11.27 and 
Proposition 11.46 to write f” + pf’ + q as a power series. For (b), start by 
using the recurrence relation of (a) to get, with the aid of the triangle inequality, 


k 
(k + 2)(k+ Dlaesal < Yo (G+ Dldellaisil + lexsllajl). 


j=0 
Then, multiply both sides of the above inequality by |x — xo|*t? to get 


k 
(k + 2)(k+ I)Ak+2 < (YG + 1) [dij — x0) MAj41 ) be - Xo 


j=0 
+ (> lox — x0) 7Aj] bx — xol" 
j=0 
«(SU + D|bx |r (Hal aly ‘lAy)r 
j=0 


_, (|x — xo| eas |x — Xo| 
+ (Seles EY ay eal 


Item (c) now follows directly from (b). Item (d) follows from (c) and induction 
on k. For second part of (e), use the recurrence relation of (d) to get 


(k + 2)(k + 1)Agyo — ACK +) kAne = Mk + 2)Ac4 1. 


Section 11.5 


. Make induction on k. 
. Write log x = log (xo + (x—x0)) = log xo + log (1 += =). Now, use the fact that 


x +> log(1+-.) is analytic on (—1, 1) and x +> =“* maps (0, 2x0) into (—1, 1) and 


Noe 


is also analytic. For what is left to do, write logx = logxo+)°,., = (x—x9)* 
_ 0 
in (0, 2x0). 
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3. Let no € N be such that ff > 0 for every x € J andn > ng; then, f™ is 
nondecreasing for every n > no. Now, given xo,x € J, the Taylor formula for f 
centered at x9 gives 


n—1 (k) 
fey = VE 


k=0 k 


(n) 
ast Oey 


for some c between xo and x. If x > xo, show that 


G=4)'—s 0. 


n=l p(k) (n) 
0< 70) - PEM —my§ < © 
k=0 . 2 


Argue analogously if x < xo. 

4. For item (a), show that f(xo), f’(xo), f’(%o), ...are uniquely determined 
by (11.39), and then apply Corollary 11.57. Item (b) is the content of 
Theorem 11.48. For item (c), let b be the right endpoint of J and suppose 
that b is not the right endpoint of J. Since p and q are analytic in J, there exists 
r > O such that (b—1r,b+7r) C J and p and q are given in this interval by their 
Taylor series centered at b. Ifa € (b - oO b), use Theorem 11.54 to show that the 
Taylor series of p and q centered at a converge in the interval (a—s,a+s), with 
s=a-—b+randa+s>b.Letg:(a—s,a+s) > R be the unique analytic 
solution of 


gs’ + ps’ + 4g =0 
g(a) = fila). 8'(a) = f,(@) 


Conclude that g(a) = 7" (a) for every integer k > 0 and then that g = f; in 
(a—s, b). Then, extend f; to JU (a—s,a+5). Finally, for (d), let J be the largest 
open interval contained in J such that (11.39) has a solution in J. Use item (c) to 
show that J = J. 
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line, 295 
Tartaglia, 38 
Tauber, Alfred, 481 
Tauberian theorem, 482 
Taylor 
Brook, 447 
formula, 447 
series, 450 
theorem of, 447 
Telescoping 
product, 84 
sum, 80 
Test 
Abel’s convergence, 243 
for improper integrals, comparison, 430 
for series, comparison, 236 
integral, 432 
ratio, 238 
root, 243 
Theorem 
Banach fixed point, 263 
binomial, 478 
dominated convergence, 460 
for integrals, mean value, 391 
intermediate value, 265 
of Arithmetic, Fundamental, 2 
of Bernstein, 510 
of Bolzano, 264 


651 


of Bolzano-Weierstrass, 213 
of Calculus, Fundamental, 351 
of Calculus, fundamental, 382 
of Cauchy, mean value, 315 
of columns, 101 
of Darboux, 325 
of diagonals, 102 
of Euler, 406, 437 
of Fejér, 470, 471 
of Fourier, convergence, 469 
of Lagrange, mean value, 313 
of Lambert, 388 
of Lebesgue, 364 
of lines, 107 
of Newton, binomial, 478 
of Pythagoras, 16 
of Rdlle, 312 
of Riemann, 360 
of Taylor, 447 
of Weierstrass, 215 
of Zeckendorf, 98 
squeezing, 218, 284 
tauberian, 481, 482 
Weierstrass approximation, 460 
Transformation of Ravi, 120 
Trapezium rule, 449 
Triangle 
inequality, 30, 32, 126 
Pascal’s, 100 
Trinomial 
expansion formula, 110 
factorisation of a second degree, 23, 24 
factorised form of a second degree, 38 
number, 110 
second degree, 23, 34 


U 

Unbounded 
from above, set, 202 
from below, set, 202 
set, 202 

Upper 
bound, least, 203 
sum, 352 


Vv 

Variable 
real, 19 
substitution, 36 

Variation, 276 
first, 321 
second, 333 


652 


Vertex of a parabola, 190 
Viéte 
formula of, 23 
formulas of, 36 
Francois, 23 


WwW 
Weierstrass 
approximation theorem, 460 
Karl, 213 
M-test of, 464 
theorem of, 215 


Y 
Young 
inequality of, 416 
William H., 416 
Z 
Zeckendorf 


Edouard, 98 
theorem of, 98 
Zero of a function, 301 


Index 


